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Preface 


The aim of this book is to present a lively and palatable introduction to financial 
option valuation for undergraduate students in mathematics, statistics and related 
areas. Prerequisites have been kept to a minimum. The reader is assumed to have a 
basic competence in calculus up to the level reached by a typical first year mathe- 
matics programme. No background in probability, statistics or numerical analysis 
is required, although some previous exposure to material in these areas would un- 
doubtedly make the text easier to assimilate on first reading. 

The contents are presented in the form of short chapters, each of which could 
reasonably be covered in a one hour teaching session. The book grew out of a final 
year undergraduate class called The Mathematics of Financial Derivatives that I 
have taught, in collaboration with Professor Xuerong Mao, at the University of 
Strathclyde. The class is aimed at students taking honours degrees in Mathematics 
or Statistics, or joint honours degrees in various combinations of Mathematics, 
Statistics, Economics, Business, Accounting, Computer Science and Physics. In 
my view, such a class has two great selling points. 


e From a student perspective, the topic is generally perceived as modern, sexy and likely 
to impress potential employers. 

e From the perspective of a university teacher, the topic provides a focus for ideas from 
mathematical modelling, analysis, stochastics and numerical analysis. 


There are many excellent books on option valuation. However, in preparing 
notes for a lecture course, I formed the opinion that there is a niche for a single, 
self-contained, introductory text that gives equal weight to 


e applied mathematics, 
e stochastics, and 
e computational algorithms. 


The classic applied mathematics view is provided by Wilmott, Howison and 
Dewynne’s text (Wilmott et al., 1995). My aim has been to write a book at a sim- 
ilar level with a less ambitious scope (only option valuation is considered), less 
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emphasis on partial differential equations, and more attention paid to stochastic 
modelling and simulation. 
Key features of this book are as follows. 


(i) Detailed derivation and discussion of the basic lognormal asset price model. 

(ii) Roughly equal weight given to binomial, finite difference and Monte Carlo methods. 
In particular, variance reduction techniques for Monte Carlo are treated in some detail. 

(iii) Heavy use of computational examples and figures as a means of illustration. 

(iv) Stand-alone MATLAB codes, with full listings and comprehensive descriptions, that 
implement the main algorithms. The core text can be read independently of the codes. 
Readers who are familiar with other programming languages or problem-solving en- 
vironments should have little difficulty in translating these examples. 


In a nutshell, this is the book that I wish had been available when I started to 
prepare lectures for the Strathclyde class. 

When designing a text like this, an immediate issue is the level at which stochas- 

tic calculus is to be treated. One of the tenets of this book is that 
rigorous, measure-theoretic, stochastic analysis, although beautiful, is hard and it is 
unrealistic to ask an undergraduate class to pick up such material on the fly. Monte 
Carlo-style simulation, on the other hand, is a relatively simple concept, and well- 
chosen computational experiments provide an excellent way to back up heuristic 
arguments. 
Hence, the approach here is to treat stochastic calculus on a nonrigorous level 
and give plenty of supporting computational examples. I rely heavily on the Cen- 
tral Limit Theorem as a basis for heuristic arguments. This involves a deliberate 
compromise — convergence in distribution must be swapped for a stronger type of 
convergence if these arguments are to be made rigorous — but I feel that erring on 
the side of accessibility is reasonable, given the aims of this text. 

In fact, in deriving the Black-Scholes partial differential equation, I do not make 
explicit reference to It6’s Lemma. I decided that a heuristic derivation of It6’s 
Lemma in a general setting followed by a single application of the lemma in one 
simple case makes less pedagogical sense than a direct ‘in situ’ heuristic treatment, 
a decision inspired by Almgren’s expository article (Almgren, 2002). I hope that 
at least some undergraduate readers will be sufficiently motivated to follow up on 
the references and become exposed to the real thing. 

You can get a feeling for the contents of the book by skimming through the 
outline bullet points that appear at the start of each chapter. Many of the later 
chapters can be read independently of each other, or, of course, omitted. 

Exercises are given at the end of each chapter. It is my experience that active 
problem solving is the best learning tool, so I strongly encourage students to make 
use of them. I have used a starring system: one star for questions whose solution 
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is relatively easy/short, rising to three stars for the hardest/longest questions. Brief 
solutions to the odd-numbered exercises are available from the book website given 
below. This leaves the even-numbered questions as a teaching resource. Certain 
questions are central to the text. I have tried to ensure that these come up in the 
odd-numbered list, in order to aid independent study. 

A short, introductory treatment like this can only scratch the surface. Hence, 
each chapter concludes with a Notes and references section, which gives my own, 
necessarily biased, hints about important omissions. References can be followed 
up via the References section at the end of the book. 

Scattered at the end of each chapter are a few quotes, designed to enlighten and 
entertain. Some of these reinforce the ideas in the text and others cast doubt on 
them. Mathematical option valuation is a strange business of sophisticated analysis 
based on simple models that have obvious flaws and perhaps do not merit such 
detailed scrutiny. When preparing lecture notes, I have found that authoritative, 
pithy quotes are a particularly powerful means to highlight some of this tension. 
I have an uneasy feeling that some Strathclyde students spent more time perusing 
the quotes than the main text, so I have aimed to make the quotes at least form 
a reasonable mini-summary of the contents. Most quotes relate directly to their 
chapter, but a few general ones have been dispersed throughout the book on the 
grounds that they were too good to leave out. 

A website for this book has been created at www.maths.strath.ac.uk/~aas96106/ 
option_book.html. It includes the following. 


e The MATLAB codes listed in the book. 

e Outline solutions to the odd-numbered exercises. 

e Links to the websites mentioned in the book. 

e Colour versions of some of the figures. 

e A list of corrections. 

e Some extra quotes that did not make it into the book. 


I am grateful to several people who have influenced this book. Nick Higham 
cast a critical eye over an early draft and made many helpful suggestions. Vicky 
Henderson checked parts of the text and patiently answered a number of ques- 
tions. Petter Wiberg gave me access to his MATLAB files for processing stock 
market data. Xuerong Mao, through animated discussions and research collabo- 
ration, has enriched my understanding of stochastics and its role in mathematical 
finance. Additionally, five anonymous reviewers provided unbiased feedback. In 
particular, one reviewer who was not in favour of the nonrigorous approach to 
stochastic analysis in this book was nevertheless generous enough to provide de- 
tailed comments that allowed me to improve the final product. Finally, three years’ 
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worth of Strathclyde honours students have helped to shape my views on how to 
present this material to a wide audience. 


MATLAB programs 


I firmly believe that the best way to check your understanding of a computational 
algorithm is to examine, and interactively experiment with, a real program. For 
this reason, I have included a Program of the Chapter at the end of every chapter, 
followed by two programming exercises. Each program illustrates a key topic. 
They can be downloaded from the website previously mentioned. 

The programs are written in MATLAB.! I chose this environment for a number 
of reasons. 


e It offers excellent random number generation and graphical output facilities. 

e It has powerful, built-in, high-level commands for matrix computations and statistics. 

e It runs on a variety of platforms. 

e It is widely available in mathematics and computer science departments and is often 
used as the basis for scientific computing or numerical analysis courses. Students may 
purchase individual copies at a modest price. 


I wrote the programs with accuracy and clarity in mind, rather than efficiency 
or elegance. I have made quite heavy use of MATLAB’s vectorization facili- 
ties, where possible working with arrays directly and eschewing unnecessary for 
loops. This tends to make the codes shorter, snappier and less daunting than alter- 
natives that operate on individual array components. Meaningful comments have 
been inserted into the codes and a ‘walkthrough’ commentary is appended in each 
case. Those walkthroughs provide MATLAB information on a just-in-time basis. 
For a comprehensive guide to MATLAB, see (Higham and Higham, 2000). 

I have not made use of any of the toolboxes that are available, at extra cost, to 
MATLAB users. This is because (a) the emphasis in the book is on understand- 
ing the underlying models and algorithms, not on the use of black-box packages, 
and (b) only a small percentage of MATLAB users will have access to toolboxes. 
However, those who wish to perform serious option valuation computations in 
MATLAB are advised to investigate the toolboxes, especially those for Finance, 
Statistics, Optimization and PDEs. 

Readers with some experience of scientific computing in languages such as 
Java, C or FORTRAN should find it relatively easy to understand the codes. Those 
with no computing background may need to put in more effort, but should find the 
process rewarding. 


1 MATLAB isa registered trademark of The MathWorks, Inc. 
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MATLAB is a commercial software product produced by The Mathworks, 
whose homepage is at www.mathworks.com/. 

Let me re-emphasize that these programs are entirely stand-alone; the book can 
be read without reference to them. However, I believe that they form a major ele- 
ment — if you understand the programs, you understand a big chunk of the material 
in this book. 


Disclaimer of warranty 


We make no warranties, express or implied, that the programs contained in this 
volume are free of error, or are consistent with any particular standard of mer- 
chantability, or that they will meet your requirements for any particular applica- 
tion. They should not be relied on for solving a problem whose incorrect solution 
could result in injury to a person or loss of property. If you do use the programs in 
such a manner, it is at your own risk. The author and publisher disclaim all liability 
for direct or consequential damages resulting from your use of the programs. 


1 
Options 


OUTLINE 


e European call and put options 
e payoff diagrams 
e how and why options are traded 


1.1 What are options? 


Throughout the book we use the term asset to describe any financial object whose 
value is known at present but is liable to change in the future. Typical examples are 


e shares in a company, 
e commodities such as gold, oil or electricity, 
e currencies, for example, the value of US $100 in euros. 


We will have much to say about assets in subsequent chapters, but let us get 
straight to the point and define an option. 


Definition A European call option gives its holder the right (but not the obli- 
gation) to purchase from the writer a prescribed asset for a prescribed price at a 
prescribed time in the future. © 


The prescribed purchase price is known as the exercise price or strike price, and 
the prescribed time in the future is known as the expiry date. 

To illustrate the idea, suppose that, today, your friend Professor Smart (the 
writer) writes a European call option that gives you (the holder) the right to buy 
100 shares in the International Business Machines (IBM) Corporation for $1000 
three months from now. After those three months have elapsed, you would then 
take one of two actions: 


(a) if the actual value of 100 IBM shares turns out to be more than $1000 you would exer- 
cise your right to buy the shares from Professor Smart — because you could immediately 
sell them for a profit. 
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(b) if the actual value of 100 IBM shares turns out to be less than $1000 you would not 
exercise your right to buy the shares from Professor Smart — the deal would not be 
worthwhile. 


Because you are not obliged to purchase the shares, you do not lose money (in 
case (a) you gain money and in case (b) you neither gain nor lose). Professor 
Smart, on the other hand, will not gain any money on the expiry date, and may 
lose an unlimited amount. To compensate for this imbalance, when the option is 
agreed (today) you would be expected to pay Professor Smart an amount of money 
known as the value of the option. 

The direct opposite of a European call option is a European put option. 


Definition A European put option gives its holder the right (but not the obliga- 
tion) to sell to the writer a prescribed asset for a prescribed price at a prescribed 
time in the future. © 


The key question that we address in this book is: how much should the holder 
pay for the privilege of holding an option? In other words, how do we compute a 
fair option value? 

To answer this question we have to devise a mathematical model for the be- 
haviour of the asset price, come up with a precise interpretation of ‘fairness’ and 
do some analysis. These steps, which take up the next seven chapters, will lead 
us to the celebrated Black-Scholes formula. Looking at practical issues and more 
exotic options will then draw us into computational algorithms, which take up the 
bulk of the remainder of the book. 

The rest of this chapter is spent on a brief review of how and why options are 
traded. 


1.2 Why do we study options? 


Options have become extremely popular; so popular that in many cases more 
money is invested in them than in the underlying assets. Why do they get so much 
attention? There are two good reasons. 


(1) Options are extremely attractive to investors, both for speculation and for hedging. 
(2) There is a systematic way to determine how much they are worth, and hence they can 
be bought and sold with some confidence. 


Point (2) is the main subject of this book. To illustrate point (1), if you believe 
that Microsoft Corporation shares are due to increase then you may speculate by 
becoming the holder of a suitable call option. Typically, you can make a greater 
profit relative to your original payout than you would do by simply purchasing the 
shares. On the other hand, if you are the owner of an American company that is 
committed to purchasing a factory in Germany for an agreed price in euros in three 
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months’ time, then you may wish to hedge some risk by taking out an option that 
makes some profit in the event that the US dollar drops in value against the euro. 

A further attraction is that by combining different types of option, an investor 
can take a position that reaps benefits from various types of asset behaviour. To 
understand this, it is useful to visualize options in terms of payoff diagrams. 

We let E denote the exercise price and S(T ) denote the asset price at the expiry 
date. (Of course, S(T) is not known at the time when the option is taken out.) In 
later chapters, S(t) will be used to denote the asset price at a general time t, and T 
will denote the expiry date. At expiry, if S(T) > E then the holder of a European 
call option may buy the asset for E and sell it in the market for S(T), gaining an 
amount S(T) — E. On the other hand, if E > S(T) then the holder gains nothing. 
Hence, we say that the value of the European call option at the expiry date, denoted 
by C, is 


C =max(S(T) — E, 0). (1.1) 


Plotting S(T) on the x-axis and C on the y-axis gives the payoff diagram in 
Figure 1.1. Consider now a European put option. If, at expiry, E > S(T) then 
the holder may buy the asset at S(T ) in the market and exercise the option by sell- 
ing it at E, gaining an amount E — S(T). On the other hand, if S(T) > E then 
the holder should do nothing. Hence, the value of the European put option at the 
expiry date, denoted by P, is 


P =max(E — S(T), 0). (1.2) 


The corresponding payoff diagram is plotted in Figure 1.2. Because of their shape, 
the piecewise linear payoff curves in Figures 1.1 and 1.2 are sometimes referred to 
as (ice) hockey sticks. 


Call option 


E ST) 


Fig. 1.1. Payoff diagram for a European call. Formula is C = max(S(T) — 
E, 0). 
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Put option 
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Fig. 1.2. Payoff diagram for a European put. Formula is P = max(E — 
S(T), 0). 


Now we may plot payoff diagrams for combinations of options. For example, 
suppose you hold a call option and a put option on the same asset with the same 
expiry date and the same strike price, E. Then the overall value at expiry is the sum 
of max(S(T) — E, 0) and max(£ — S(T), 0), which is equivalent to |S(T) — E], 
see Exercise 1.2. This combination goes under the unfortunate name of a bottom 
straddle. The holder of a bottom straddle benefits when the asset price at expiry is 
far away from the strike price — it does not matter whether the asset finishes above 
or below the strike. 

Another possibility is to hold a call option with exercise price EF, and, for the 
same asset and expiry date, to write a call option with exercise price E2, where 
E2 > E,. At the expiry date, the value of the first option is max($(7) — E1, 0) and 
the value of the second is — max($(T) — E2, 0). Hence, the overall value at expiry 
is max(S (T) — F,,0) — max(S (T) — E2, 0). The corresponding payoff diagram 
is plotted in Figure 1.3. This combination gives an example of a bull spread. We 
see from the figure that the holder of such a spread benefits when the asset price 
finishes above E4, but gets no extra benefit if it is above E2. 


1.3 How are options traded? 


Options can be traded on a number of official exchanges. The first of these, the 
Chicago Board Options Exchange (CBOE), started in 1973 and there are more 
than 50 throughout the world in 2004. Most exchanges operate through the use 
of market makers, individuals who are obliged to buy or sell options whenever 
asked to do so. On request, the market maker will quote a price for the option. 
More precisely, two prices will be quoted, the bid and the ask. The bid is the 
price at which the market maker will buy the option from you and the ask is the 
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Bull spread 


E- E; 


E E 
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Fig. 1.3. Payoff diagram for a bull spread. Formula is B = max(S(T) — 
E,,0) — max(S(T) — Eo, 0). 


price at which the market maker will sell it to you. The bid is lower than the ask, 
because the market maker needs to make a living. The difference between the ask 
and the bid is known as the bid—ask spread. Typically, market makers aim to make 
their profits from the bid—ask spread and do not wish to speculate on the market; 
they seek to hedge away their risks using the type of technique that is covered in 
Chapters 8 and 9. 

Options are also traded directly between large financial institutions — so called 
over-the-counter or OTC deals. These options often have nonstandard features that 
are tailored to the particular needs of the parties involved. 

The Financial Times newspaper tabulates the prices of some options that may 
be traded on the London International Financial Futures & Options Exchange 
(LIFFE). For example, the issue from Friday, 19 September 2003 included the 
information 


Calls Puts 
Option Oct Nov Dec Oct Nov Dec 
Royal Bk Scot. 1600 67.0 92.5 109.5 29.0 49.0 62.5 
(1634.0) 1700 19.5 43.5 59.0 82.0 100.0 112.5 


The number 1634.0 is the closing price of The Royal Bank of Scotland’s shares 
from the previous day. The numbers 1600 and 1700 are two exercise prices, in 
pence. (The Financial Times lists information for these exercise prices only, but the 
exchange offers options for many other exercise prices.) The numbers 67.0, 92.5, 
109.5 are the prices of the call options with exercise price 1600 and expiry dates in 
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Oct, Nov and Dec, respectively (more precisely, for 18:00 on the third Wednesday 
of each month). Similarly, 19.5, 43.5, 59.0 are the prices of call options with exer- 
cise price 1700 for those expiry dates. The numbers 29.0, 49.0, 62.5 give the prices 
of put options with exercise price 1600 and expiry dates in Oct, Nov and Dec, and 
82.0, 100.0, 112.5 are the corresponding put option prices for exercise price 1700. 
The numbers quoted lie somewhere between the bid and the ask. 

The Wall Street Journal publishes option data in a similar form. Many providers 
offer electronic data access, with some basic information being available in the 
public domain; see Section 5.5 for some pointers. 


1.4 Typical option prices 


Figure 1.4 shows some prices for call and put options on IBM shares that were 
available on the New York Stock Exchange on 13 October 2002. Some of the data 
from Figure 1.4 is repeated in a slightly different format in Figure 1.5. The prevail- 
ing asset price, more precisely the price paid at the most recent trade, was 74.25, 
marked ‘Now’ in Figure 1.4. Option prices were available for a range of strike 
prices and expiry times. These prices relate to American, rather than European, 
options. Americans are introduced in Chapter 18. For the moment we note that an 
American call has the same value as a European call (assuming that no dividends 
are paid), and an American put has a higher value than a European put. 

In this example, for a given expiry time, the call option price decreases as the 
strike price increases. This is perfectly reasonable. Increasing the strike price has a 
negative effect on the payoff and hence reduces the call option’s worth. Similarly, 
the put price increases with increasing strike price. It can also be observed from 
the figures that, for a given strike price, both the call and the put option prices 


Call 


Strike 


Fig. 1.4. Market values for IBM call and put options, for a range of strike prices 
and times to expiry. 
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Fig. 1.5. Market values for IBM call (left) and put (right) options, for a range of 
strike prices and times to expiry. This displays a subset of the data in Figure 1.4. 


increase when the time to expiry increases. This behaviour is generic for European 
call options, as we will see in Section 2.6. 


1.5 Other financial derivatives 


European call and put options are the classic examples of financial derivatives. The 
term derivative indicates that their value is derived from the underlying asset — it 
has nothing to do with the mathematical meaning of a derivative. This book focuses 
exclusively on options. We will develop our mathematical analysis with European 
options in mind, and in later chapters we will introduce American and other more 
exotic options. 


1.6 Notes and references 


There are many introductory texts that explain how stock markets operate; see, for 
example, Dalton (2001); Walker (1991). Chapter 6 of Hull (2000) is also a good 
source of basic practical information about option trading, including 


e what range of expiry dates and exercise prices are typically offered, 
e how dividends and stock splits are dealt with, and 
e how money and products actually change hands. 


Section 5.5 gives the web pages of some stock exchanges. 


EXERCISES 
1.1. x Insert the word ‘rise’ or ‘fall’ to complete the following sentences: 


The holder of a European call option hopes the asset price will... 
The writer of a European call option hopes the asset price will... 


8 Options 


The holder of a European put option hopes the asset price will... 
The writer of a European put option hopes the asset price will... 


1.2. * Convince yourself that max(S(T) — E,0) + max(£ — S(T), 0) is equiv- 
alent to |S (T) — E| and draw the payoff diagram for this bottom straddle. 

1.3. x» Suppose that for the same asset and expiry date, you hold a European 
call option with exercise price E; and another with exercise price E3, where 
E3 > E; and also write two calls with exercise price E? := (E1 + E3)/2. 
This is an example of a butterfly spread.' Derive a formula for the value of 
this butterfly spread at expiry and draw the corresponding payoff diagram. 

1.4. x The holder of the bull spread with payoff diagram in Figure 1.3 would like 
the asset price on the expiry date to be at least as high as F2, but, if it is, 
the holder does not care how much it exceeds E2. Make similar statements 
about the holders of the bottom straddle in Exercise 1.2 and the butterfly 
spread in Exercise 1.3. 


1.7 Program of Chapter 1 and walkthrough 


Our first MATLAB program uses basic plotting commands to draw a bull spread payoff diagram, as 
shown in Figure 1.3, for particular parameters E] and Ey. The program is called ch01 and is stored in 
the file ch01 .m. It is listed in Figure 1.6. The program is run by typing ch01 at the MATLAB prompt. 
The first three lines begin with the symbol % and hence are comment lines. These lines are ignored 
by MATLAB, they are used to provide information to humans who are reading through the code. 
Comment lines may be inserted anywhere, but those at the start of a code have a special property — 
typing help ch01 causes the information 


CHO1 Program for chapter 1 
Plots a simple payoff diagram 


to be echoed to the user. It is customary for the first comment line to begin with the name of the file 
in capital letters, even though the file itself has a lower case name. 

The first command, clf, clears the current figure window, so that any previous graphical output 
is removed. The lines E1 = 2; and E2 = 4; are assignment statements. Variables E1 and E2 are 
automatically created and given those values. The semi-colon at the end of each line causes output 
to be suppressed. Without those semi-colons, the information 


Ei = 2 
E2 = 4 


would be displayed on your screen. The line S = linspace(0,6,100) sets up a one-dimensional 
array S with 100 components, equally spaced between 0 and 6. This could be confirmed after run- 
ning the program by typing S at the MATLAB prompt. The command max (S-E1,0) creates a one- 
dimensional array whose ith entry is the maximum of S(i)-E1 and 0. Note that MATLAB is happy 
to mix arrays and scalars, and will apply the max function in a componentwise manner. Overall 


1 Serve with warm toast. 
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the line B = max(S-E1,0) - max(S-E2,0); creates a one-dimensional array B of payoff values 
corresponding to S. 


%CHO1 Program for chapter 1 
% 
% Plots a simple payoff diagram 


clf 
E1=2; 
E2 = 4; 


S = linspace(0,6,100); 

B = max(S-E1,0)-max(S-E2,0); 
plot(S,B) 

ylim([0,3]) 


xlabel(’S’) 

ylabel(’B’) 

title(Bull Spread Payoff’) 
grid on 


Fig. 1.6. Program of Chapter 1: chO1.m. 


We then plot the payoff diagram with plot (S,B). By default, MATLAB chooses the range for the 
axes, the location of the axis tick marks, the colour and type of the line, and many other features. 
These may be altered with extra commands or via the menu-driven toolbars in the figure window. 
We have specified ylim( [0,3] ), which overrides the y-axis limits that MATLAB would otherwise 
choose automatically. Axis labels and a title are produced by xlabel(’S’), ylabel(’B’) and 
title(’Bull Spread Payoff’). The final command, grid on, causes horizontal and vertical 
dotted reference lines to appear in the plot. Running the program, that is, typing ch01 at the prompt, 
puts a picture similar to Figure 1.3 in a pop-up figure window. 

Typing help linspace, help max, help plot, etc., at the command line gives more informa- 
tion about those functions, and MATLAB’s online documentation, roused by typing doc, forms a 
hypertext style manual. 


PROGRAMMING EXERCISES 
P1.1. Use the input command to produce a variant of chO1 that allows E1 and 
E2 to be specified by the user. 


P1.2. Create a program that plots the payoff diagram for a butterfly spread, as 
described in Exercise 1.3. 


Quotes 


Because the action is faster and the margins thinner — five percent down 
will buy you a futures contract on the DAX 30 in Frankfurt, 
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the CAC 40 in Paris, the FTSE 100 in London, the Nikkei 225 in Tokyo, 
or the Standard & Poor’s 500 in New York — trading in derivatives now swamps 
the markets on which they depend. 
THOMAS A. BASS (Bass, 1999) 


If you believe an asset will rise in price, 
then you may buy a call option to capture a very large potential gain 
with a small investment; 
however, if your belief is wrong 
then you may very easily lose your entire option investment. 
ROBERT ALMGREN (Almgren, 2002) 


Imagine visiting your local used-car dealer to sell him your old Ford. 
He kicks the tires, points to a dent in the fender, 
and offers you a hundred bucks. 
Suppose the following day you are tempted to go back 
and buy your Ford off the lot. 
The dealer will point to the low mileage and tell you 
that he can’t let the car go for less than two hundred dollars. 
This is the difference between the bid, buying price, and the ask, selling price. 
THOMAS A. BASS (Bass, 1999) 


The Pterodactyl is very rarely encountered in real trading. 
The technicians may, however, wish to know 
that it consists of a spread of traditional butterflies. ... 
The use of this position is not recommended unless the author needs a new car. 
A. L. H. SMITH (Smith, 1986) 


Recent history is replete with examples of derivatives trading gone awry. 
PHILIP MCBRIDE JOHNSON (Johnson, 1999) 


Winter, spring, summer or fall, 
all you have to do is call... 
CAROL KING, You’ ve Got a Friend, EMI Music Inc. 
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Option valuation preliminaries 


OUTLINE 


e continuously compounded interest rate 
e short selling 


no arbitrage principle 


put—call parity 
e basic inequalities for option values 


2.1 Motivation 


There are certain simple results about option valuation that can be deduced from 
first principles, using elementary mathematics. This chapter derives such results. 
To do this we introduce two key concepts: discounting for interest and the no arbi- 
trage principle. The results that we derive do not require us to make any assump- 
tions about the behaviour of the underlying asset, nor do they use any probability 
theory. 


2.2 Interest rates 


Suppose we have some money in a risk-free savings account. If this investment 
grows according to a continuously compounded interest rate, r, then its value in- 
creases by a factor e” over a time length r. In other words, an amount Do at time 
zero is worth 


D(t) =e" Do (2.1) 


at time t. To be specific, we will use r to denote the annual rate, so that time is 
measured in years. Typical values of r lie between 0.01 and 0.1 (1% and 10% 
interest rates). It is not important in this book whether D (t) is measured in dollars, 
euros, or any other currency. 
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Throughout the book we make the standing assumption that a fixed interest rate 
r prevails whenever cash is lent or borrowed; the same rate r applies at all times 
and whatever amount of cash is involved. This assumption is, of course, only an 
approximation to reality — interest rates change over time and typically depend on 
the size of the investment. 

An immediate consequence of our interest rate assumption is that if somebody 
were to make you the offer of 


(a) $100 immediately (time t = 0), or 
(b) $100e” at time t 


then you would regard both offers as being of equal value. (In case (a) you could 
invest the money to obtain $100e”’ at time t. In case (b) you could borrow $100 
immediately and repay the loan at time t.) 

Similarly, a deal that is guaranteed to produce exactly $100 at time f is worth 
exactly $100e—”’ at time zero. Transferring from $100 to $100e~"" in this way is 
called discounting for interest or discounting for inflation, and it is a concept that 
we will use frequently. 


2.3 Short selling 


We use the term portfolio to describe a combination of 


(i) assets, 
(ii) options, and 
(iii) cash (invested in a bank). 


Moreover, we will assume that it is possible to hold negative amounts of each. A 
negative amount of cash has the obvious interpretation that cash has been borrowed 
rather than invested in the bank. Owning a negative amount of asset or option 
might not seem so reasonable. However, in many cases this is possible through the 
practice of short selling, which means selling an item that is not owned with the 
intention of buying it back at a later date. In practice, to short sell an item you must 
first borrow it from somebody who owns it, and give it back later. We will assume 
that this is always possible, at no cost, and that the short seller is free to choose 
when to buy back and return the item. 

To illustrate the idea, let S(t) denote the value of an asset at time t. If we short 
sell an asset at time tı and buy it back at time fo, then we have 


(a) gained an amount S(t) at time t = tı from the short sale, 
(b) paid out an amount S (t2) at time t = t2 from the buy back. 
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Having invested the initial gain, the overall profit/loss at time t = fp is therefore 
e O=) S(t) — S(t). 


2.4 Arbitrage 


One of the key principles on which option valuation theory rests is no arbitrage. 
This may be summarized as follows. 

There is never an opportunity to make a risk-free profit that gives a greater return than that 
provided by the interest from a bank deposit. 

Note that this assumption applies only to risk-free profit, it is not relevant 
to portfolios that ‘have a good chance’ of making a greater return than a bank 
deposit. 

To justify the no arbitrage assumption, suppose it were possible to put together 
a portfolio that gave a guaranteed improvement on the bank’s interest rate. Sen- 
sible investors would simply borrow money from the bank and spend it on the 
portfolio, thereby locking in to a guaranteed risk-free profit. The forces of supply 
and demand would then cause the yield from the portfolio to drop, or the inter- 
est rate to increase, or both, until parity was restored. Further justification for this 
assumption is provided by the existence of arbitrageurs who scour the markets 
seeking to exploit any opportunities for risk-free profits beyond the interest rate 
level. 


2.5 Put—call parity 


There is a delightfully simple argument that defines a relationship between the 
value C of a European call option and the value P of a European put option, with 
the same strike price E and expiry date T. (In this section and the next, value is 
taken by default to mean value at time t = 0.) Consider two portfolios 

m4: one call option plus Ee~’! cash (invested in a bank), 
xg: one put option plus one unit of the asset. 


At the expiry date, the portfolio 7,4 is worth max(S(T) — E,0)+ E, which is 
max(S(7'), E). The portfolio zg is worth on expiry max(E — S(T),0)+ S(T), 
which also reduces to max(S (T), E). Common sense dictates that since the two 
portfolios always give the same payoff, they must have the same value at time zero, 
so 


C+Ee™ =P +5. (2.2) 


This relationship, which connects the values of the call and put, is called put—call 
parity. Note that (2.2) was derived without any assumptions about the behaviour 
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of the asset. Because of put—call parity, if we can work out a procedure for valuing 
a European call option, we automatically get a procedure for valuing a European 
put option, and vice versa. 

The argument behind (2.2) can be made more precise via the no arbitrage prin- 
ciple. If z4 were worth more than srg at time 0 then it would be possible to sell 24 
(that is, sell the call option and borrow the cash) and buy srg (that is, buy one put 
option and one share). This brings us an instantaneous profit of 74 — mg (since 
we are sure that the payoff from zg exactly compensates for that of 7 4 at expiry). 
Such instantaneous profit clearly violates the no arbitrage principle. A similar ar- 
gument applies if zg is worth more than zr, at time zero. 


2.6 Upper and lower bounds on option values 


Similar arguments to those above can be used to obtain simple upper and lower 
bounds on the values C and P of European call and put options. 
To study the call option, consider two portfolios: 


m4: one call option plus Ee"? 


Tg: one unit of asset. 


cash (invested in a bank), 


We saw above that 7 4 has payoff max(S (T), E) at expiry. The portfolio Tg has 
payoff S(T), which is never greater than the payoff for 24. Common sense (or, 
more formally, the no arbitrage assumption — Exercise 2.3) dictates that 7g must 
therefore have a time-zero value that is no greater than that for 74. This means 
§<C+Ee!, or 


C> S-E. (2.3) 
Since the call option cannot have a negative value, we may strengthen this to 
C > max(S — Ee™7 0). (2.4) 


On the other hand, since the strike price E is always > 0, the call option can never 
be worth more than the underlying asset, so 


C <S. (2.5) 


Figure 2.1 illustrates the bounds (2.4) and (2.5). 
The corresponding upper and lower bounds for P, 


P > max(Ee~"! — S, 0) and P < Ee", (2.6) 


can be derived either by a similar argument, or via (2.4)—(2.5) and put—call parity 
(2.2), see Exercise 2.4. 


2.6 Upper and lower bounds on option values 15 
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Fig. 2.1. Upper and lower bounds (2.4) and (2.5) for European call option. Here, 
the x-axis is S, the asset price at time zero, and the y-axis is C, option value at 
time zero. Option value C must lie in the shaded region. 


A final result that we can prove from first principles is as follows. 


The time-zero European call option value, C, is nondecreasing as a function of the expiry 
date T. 


To see this, consider European call options with expiry dates Tı and T2, with 
Tə > T,, having the same strike price, Æ. We will show that the holder of the T2 
option can guarantee to get a payoff at least as big as e” 27T) max(S(T|) — E, 0). 
Suppose the T) option holder takes the following action at time t = T}. 


Case 1: If S(T1) < E do nothing. (The T; option has zero payoff, so the T2 option payoff 
will be no worse.) 

Case 2: If S(T1) > E then short sell one unit of the asset at time t = 71, invest the money, 
and buy back the asset at t = T2. (Intuitively, the Tı option produces a positive 
payoff. In order to match it, the T) holder guards against future decrease in the 
asset price by taking out an investment that gains if the asset falls.) 


In Case 1 it is trivially true that the T option holder makes an overall profit of 
at least e” 2-11) max(S (T1) — E, 0) = 0. In Case 2, the Tz option holder has a 
payoff at time t = T2 made up of 


(a) max(S (T2) — E, 0) from the original T> option, plus 

(b) e” T271 §(T;) from investing the proceeds of short selling the asset at time t = T}, 
plus 

(c) —S (T2) from covering the short sale. 
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The overall payoff at t = Tə is thus 
max(S(T2) — E,0) + e” P7 §(T,) — S(T) 
= max (OM scr) — E, e T-TÒS (T1) — ST») 
e P-TS(T1) — E 
gS =E) 
= e" T271) max(S (T1) — E, 0). 


IV 


IV 


(The last line follows because we are in Case 2.) 

We have shown that there is a strategy whereby, after discounting for interest, 
the T2 option holder can guarantee to have a payoff at least as great as that of the 
Tı option holder. Hence, by the no arbitrage principle, the T2 option must have a 
value at least as great as that of the Tı option. 

It is perhaps surprising that there is no such simple result for European put 
options; see Exercise 10.7 in Chapter 10. 

This chapter has given an indication of some simple results that can be derived 
from first principles. To proceed further we need to make assumptions about the 
behaviour of the underlying asset, which leads us immediately into the realms of 
probability and random variables. 


2.7 Notes and references 


Further details about arbitraging and short selling can be found, for example, in 
(Hull, 2000). 


EXERCISES 


2.1. x Compound interest works as follows. An investment Dg at time zero when 
compounded m times up to time f at rate re becomes worth 


ret \" 
D(t) = (1 į 2) Do. 
m 


Show that, for a given m, the compound interest rate ro that produces the 
same amount as the continuously compounded value e”' Do satisfies re = 
m(e"/™ — 1)/t. Use the approximation e* ~ 1 + x for small x to show that 
re ~r when m is large. (Note that in this book we always work with contin- 
uously compounded interest.) 

2.2. xx The continuously compounded interest rate formula can be derived by 


(a) splitting the time interval [0, t] into subintervals [0, ôt], [6t, 25r],..., [(L — 1) 
ôt, Lêt], where ôt = t/L, and 


2.3. 
2.4. 


2.5. 


2.6. 
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(b) assuming that the value of the investment increases by a relative amount pro- 
portional to rôt over each subinterval. 


Letting t; = iôt, this means 

D(ti41) = (1 +rôt)D (ti), (2.7) 
and hence 

D(t = tL) = (1 + rôt)} Do. 


By writing this as D(t) = et 8"+"'/4) Do and using log(1 + €) = € + 
O (€?) as € — 0, show that this model reproduces the formula (2.1) in the 
limit L > œ (i.e. ôt —> 0). Show that the models 


D(ti41) = (1 F rV/5t) D(ti) (2.8) 
and 
Dti) = (1 +r 60?) D0) (2.9) 


are not consistent with continuous compounding in the limit L — oo. 

xx Give an argument based on the no arbitrage assumption that justifies (2.3). 
xx Establish (2.6) (a) by setting up suitable portfolios and applying the argu- 
ments used to get (2.4)-(2.5), and (b) separately, by using (2.4)-(2.5) plus 
put—call parity (2.2). 

x x x Show that a butterfly spread with exactly the same payoff as that in 
Exercise 1.3 can be obtained using only a combination of European put op- 
tions. Use put—call parity (2.2) to confirm that the two spreads have the same 
set-up cost. 

xxx A forward contract, which is similar to a futures contract, operates 
as follows. Now, at time t = 0, Party A agrees to purchase an asset from 
Party B at a specified delivery time t = T for a specified price F. (Note that 
Party A is committed to the future purchase — by contrast, with a European 
call option the holder has the right, but not the obligation, to buy at the 
prescribed price.) Appealing to the no arbitrage assumption, show that a fair 
value for F is S(O)e"!. 


2.8 Program of Chapter 2 and walkthrough 


The program ch02, which illustrates the connection between compound and continuous interest cov- 


ered in Exercise 2.1, makes use of MATLAB’s for loop construction. It is listed in Figure 2.2. After 


clearing the figure window and initializing dzero, r, T and m, we use a for loop to do the main com- 


putation. The syntax for i = 1:m...end causes the enclosed statements to be executed m times, 


first with i=1, then with i=2, and so on up to i=m. It follows from Exercise 2.1 that d(i) is the value 
of the investment after i months, corresponding to time tval (i). 
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%CHO2 Program for Chapter 2 
% 
% Illustrates compound interest 


clf 

dzero = 5; 

r = 0.15; % Compound interest rate 
T=5; % 5 year period 

m = 60; % 60 months 

fori = 1:m % let i months elapse 


tval(i) = i/12; % time in years 
d(i) = dzero*(1+r*tval(i)/i)i; % compound i times 
end 


trange = [0,tval]; 
drange = [dzero,d]; 


plot(trange,drange,’r*’) 

hold on 

tects = linspace(0,T,100); 

dcts = dzero*exp(r*tcts); 

plot(tcts,dcts,’b-’) 

grid on 

xlabel(’t’) 

ylabel("D(t)’) 

title(‘Compound versus Continuous Interest’) 
legend(’Compound’,’Continuous’,2) 


Fig. 2.2. Program of Chapter 2: chO2.m. 


The line trange = [0,tval] creates a new one-dimensional array whose first entry is 0, second 
entry is tval (1), third entry is tval (2), etc. Similarly drange has entries dzero, d(1), d(2),..., 
d(m). This is done so that the initial values are included in the plot. The legend function produces 
the annotation seen at the top right-hand corner of the picture. The remainder of the program uses 
commands explained in Chapter 1. The program produces the picture shown in Figure 2.3. 


PROGRAMMING EXERCISES 
P2.1. Adapt ch02 to show that the models (2.8) and (2.9) are not sensible. 


P2.2. Use the command fill to create a picture similar to Figure 2.1. 


Quotes 


The principle of no-arbitrage pricing is obvious, 
but its application leads to many subtle and unanticipated pricing relationships. 
JOHN H. COCHRANE (Cochrane, 2001) 
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Compound versus Continuous Interest 


» Compound 
— Continuous 


D(t) 


Fig. 2.3. Figure produced by ch02. 


In need of a euphemism for what we did with other people’s money, 
we called it ‘arbitrage’, 
which was just plain obfuscation. 
MICHAEL LEWIS (Lewis, 1989) 


In practical terms, those who go short sell a security they have borrowed. 
They must return the security later — by which time, they believe, 
the price will have declined. 
The principle of buying cheap and selling dear still holds. 
Short sellers merely reverse the order: sell dear, then buy cheap. 
ROGER LOWENSTEIN (Lowenstein, 2001) 


Others are contemplating a bet known as the O’Hare straddle. 
You max out your credit line, 
make a stab at guessing tomorrow’s opening prices, 
and flee to O’Hare Airport. 
You wake up tomorrow in Rio, either a bankrupt fugitive 
or a lucky millionaire. 
THOMAS A. BASS (Bass, 1999) 
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Random variables 


OUTLINE 


e discrete and continuous random variables 
e expected value and variance 

e uniform and normal distributions 

e Central Limit Theorem 


3.1 Motivation 


The mathematical ideas that we develop in this book are going to involve random 
variables. In this chapter we give a very brief introduction to the main ideas that 
are needed. If this material is completely new to you, then you may need to refer 
back to this chapter as you progress through the book. 


3.2 Random variables, probability and mean 


If we roll a fair dice, each of the six possible outcomes 1, 2, ..., 6 is equally likely. 
So we say that each outcome has probability 1/6. We can generalize this idea to the 
case of a discrete random variable X that takes values from a finite set of numbers 
{x1,X2,...,Xm}. Associated with the random variable X are a set of probabilities 
{P1, P2, ---, Pm} such that x; occurs with probability p;. We write P(X = x;) to 
mean ‘the probability that X = x;’. For this to make sense we require 


e pi> 0, foralli (negative probabilities not allowed), 
e5 pi=l (probabilities add up to 1). 


The mean, or expected value, of a discrete random variable X, denoted by E(X), 
is defined by 


E= S p: (3.1) 


i=l 


21 


22 Random variables 


Note that for the dice example above we have 


iO) =e a ee 
~ 6 6 6 a 
which is intuitively reasonable. 


Example A random variable X that takes the value 1 with probability p (where 
0 < p < 1) and takes the value 0 with probability 1 — p is called a Bernoulli 
random variable with parameter p. Here, m = 2, xı = 1, x2 = 0, pı = p and 


p2 = 1 — p, in the notation above. For such a random variable we have 


E(X) = 1p +0(1 — p) =p. (3.2) 
< 


A continuous random variable may take any value in R. In this book, continuous 
random variables are characterized by their density functions. If X is a continuous 
random variable then we assume that there is a real-valued density function f 
such that the probability ofa < X < b is found by integrating f (x) from x = a to 
x = b; that is, 


b 
P(a < X < b) = J f (x)dx. (3.3) 


Here, P(a < X < b) means ‘the probability that a < X < b’. For this to make 
sense we require 


e f(x) =>0, forall x (negative probabilities not allowed), 
° a f(x)dx=1 (density integrates to 1). 


The mean, or expected value, of a continuous random variable X, denoted E(X), 
is defined by 


E(X) := f. xf (x)dx. (3.4) 


Note that in some cases this infinite integral does not exist. In this book, whenever 
we write E we are implicitly assuming that the integral exists. 


Example A random variable X with density function 


(B —a)!, fora <x < B, 
0 otherwise, 


fœ) = | (3.5) 
is said to have a uniform distribution over (a, p). We write X ~ U(a, B). 
Loosely, X only takes values between œ and 6 and is equally likely to take any 
such value. More precisely, given values xı and x2 with a < xı < x2 < B, the 
probability that X takes a value in the interval [x1, x2] is given by the relative 


3.3 Independence 23 


size of the interval: (x2 — x,)/($ — æ). Exercise 3.1 asks you to confirm this. If 
X ~ U(«, P) then X has mean given by 


x oa 1 B 1 x? p a+ fp 
Xx) =f xtords= 5 | sem 5" |3] = 7 A 


Generally, if X and Y are random variables, then we may create new random 
variables by combining them. So, for example, X + Y, X? + sin(Y) and eV X+Y 
are also random variables. 

Two fundamental identities that apply for any random variables X and Y are 


© 


E(X +Y) =E(X) +E), (3.6) 
E(aX) = aE(X), fora e R. (3.7) 


In words: the mean of the sum is the sum of the means, and the mean scales lin- 
early. The following result will also prove to be very useful. If we apply a function 
h to a continuous random variable X then the mean of the random variable A(X) 
is given by 


E(h(X)) =| h(x) f (x)dx. (3.8) 


3.3 Independence 


If we say that the two random variables X and Y are independent, then this has 
an intuitively reasonable interpretation — the value taken by X does not depend on 
the value taken by Y, and vice versa. To state the classical, formal definition of 
independence requires more background theory than we have given here, but an 
equivalent condition is 


E(9(X)h(Y)) = E(e(X))E(A(Y)), —forallg,h: ROR. 


In particular, taking g and A to be the identity function, we have 


X and Y independent > E(XY) = E(X)E(Y). (3.9) 


Note that E(XY) = E(X)E(’) does not hold, in general, when X and Y are 
not independent. For example, taking X as in Exercise 3.4 and Y = X we have 
W(X?) #& EX). 

We will sometimes encounter sequences of random variables that are indepen- 
dent and identically distributed, abbreviated to i.i.d. Saying that X1, X2, X3,... 
are i.i.d. means that 
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(i) in the discrete case the X; have the same possible values {x1, x2, ..., Xm} and prob- 
abilities {p1, p2,..., Pm}, and in the continuous case the X; have the same density 
function f(x), and 

(ii) being told the values of any subset of the X;s tells us nothing about the values of the 
remaining X;s. 


In particular, if X1, X2, X3,... are iid. then they are pairwise independent and 
hence E(X;X ;) = E(X;)E(X ;), fori Æ j. 


3.4 Variance 


Having defined the mean of discrete and continuous random variables in (3.1) and 
(3.4), we may define the variance as 


var(X) := E(X — E(X)”. (3.10) 


Loosely, the mean tells you the ‘typical’ or ‘average’ value and the variance gives 
you the amount of ‘variation’ around this value. 
The variance has the equivalent definition 


var(X) := E(X’) — (E(X))’; (3.11) 


see Exercise 3.3. That exercise also asks you to confirm the scaling property 
var(aX) = a’var(X), fora € R. (3.12) 


The standard deviation, which we denote by std, is simply the square root of 
the variance; that is 


std(X) := yvar(X). (3.13) 


Example Suppose X is a Bernoulli random variable with parameter p, as in- 
troduced above. Then (X — E(X))? takes the value (1 — py with probability p 
and p* with probability 1 — p. Hence, using (3.10), 


var(X) = E(X — E(X))*) = (1 — př p + pP A- p) =p- p. (3.14) 


It follows that taking p = 5 maximizes the variance. © 


Example For X ~ U(a, 8) we have E(X?) = (œ? + af + B7)/3 and hence, 
from (3.11), var(X) = (£ — a)*/12, see Exercise 3.5. So, if Y} ~ U(—1, 1) and 
Y2 ~ U(-2, 2), then Yı and Yz have the same mean, but Yz has a bigger variance, 
as we would expect. ° 
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mE N(0,1) density 


0.45} | | 

0.44 

0.35} 

0.3} 

f(x) 0.25} 
0.2} 

0.15} 

0.1} 


0.05; 


0 


Fig. 3.1. Density function (3.15) for an N(O, 1) random variable. 


3.5 Normal distribution 


One particular type of random variable turns out to be by far the most important for 
our purposes (and indeed for most purposes). If X is a continuous random variable 
with density function 


te (3.15) 
——e 2, .l 
v27 
then we say that X has the standard normal distribution and we write X ~ N(O, 1). 
Here N stands for normal, 0 is the mean and 1 is the variance; so for this X we 
have E(X) = 0 and var(X) = 1, see Exercise 3.7. Plotting the density f in (3.15) 
reveals the familiar bell-shaped curve; see Figure 3.1. 

More generally, a N(u, o?) random variable, which is characterized by the den- 
sity function 


fœ) = 


Qw)? 


se T (3.16) 


fœ) = 


210 


has mean u and variance o7; see Exercise 3.8. Figure 3.2 plots density functions 


for various u and o. The curves are symmetric about x = u. Increasing the vari- 


ance g? causes the density to flatten out — making extreme values more likely. 
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u=0,0=1 u=4,0=1 
0.5 0.5 
0.4 0.4 
0.3 0.3 
f(x) f(x) 
0.2 0.2 
0.1 0.1 


m= -1,0=3 

0.5 

0.4 0.4 
0.3 0.3 

f(x) f(x) 
0.2 0.2 
0.1 í È 0.1 
-10 -5 0 5 10 -10 -5 0 5 10 
x x 


Fig. 3.2. Density functions for various N(x, 0”) random variables. 


Given a density function f(x) for a continuous random variable X, we may 
define the distribution function F (x) := P(X < x), or, equivalently, 


F(x) := f f(s)ds. (3.17) 


In words, F (x) is the area under the density curve to the left of x. The distribution 
function for a standard normal random variable turns out to play a central role in 
this book, so we will denote it by N (x): 


x s2 
N(x) := = He e 2 ds. (3.18) 


Figure 3.3 gives a plot of N(x). 
Some useful properties of normal random variables are: 


(i) If X ~ N(w, o?) then (X — w)/o ~ N(O, 1). 
(ii) If Y ~ N(O, 1) then oY + u ~ N(u, o”). 
(iii) If Xı ~ N(u1, 07), X2 ~ N(u2, 05) and X; and X2 are independent, then X1 + 
Xa ~ Ne + m2, o? + o2). 
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F(x) 
0.5 m 


—4 -2 0 2 4 


Fig. 3.3. Upper picture: N(0, 1) density. Lower picture: the distribution function 
N (x) — for each x this is the area of the shaded region in the upper picture. 


3.6 Central Limit Theorem 


A fundamental, beautiful and far-reaching result in probability theory says that the 
sum of a large number of i.i.d. random variables will be approximately normal. 
This is the Central Limit Theorem. To be more precise, let X1, X2, X3,... be a 
sequence of i.i.d. random variables, each with mean jz and variance o?, and let 


Sn = Yxi. 


i=l 


The Central Limit Theorem says that for large n, Sn behaves like an N(n u, no”) 
random variable. More precisely, (Sn — nu)/(o s/n) is approximately N(O, 1) in 
the sense that for any x we have 


P (S= < x) —> N(x), asn — OO. (3.19) 
oyn 

The result (3.19) involves convergence in distribution. It says that the distribu- 

tion function for (S — nu)/(o s/n) converges pointwise to N (x). There are many 

other, distinct senses in which a sequence of random variables may exhibit some 

sort of limiting behaviour, but none of them will be discussed in this book. So 

whenever we argue that a sequence of random variables is ‘close to some random 
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variable X’, we implicitly mean close in this distributional sense. We will be us- 
ing the Central Limit Theorem as a means to derive heuristically a number of 
stochastic expressions. Justifying these derivations rigorously would require us to 
introduce stronger concepts of convergence and set up some technical machinery. 
To keep the book as accessible as possible, we have chosen to avoid this route. 
Fortunately, the Central Limit Theorem does not lead us astray. 

An awareness of the Central Limit Theorem has led many scientists to make 
the following logical step: real-life systems are subject to a range of external in- 
fluences that can be reasonably approximated by i.i.d. random variables and hence 
the overall effect can be reasonably modelled by a single normal random vari- 
able with an appropriate mean and variance. This is why normal random variables 
are ubiquitous in stochastic modelling. With this in mind, it should come as no 
surprise that normal random variables will play a leading role when we tackle the 
problem of modelling assets and valuing financial options. 


3.7 Notes and references 


The purpose of this chapter was to equip you with the minimum amount of material 
on random variables and probability that is needed in the rest of the book. As such, 
it has left a vast amount unsaid. There are many good introductory books on the 
subject. A popular choice is (Grimmett and Welsh, 1986), which leads on to the 
more advanced text (Grimmett and Stirzaker, 2001). 

Lighter reading is provided by two highly accessible texts of a more informal 
nature, (Isaac, 1995) and (Nahin, 2000). 

A comprehensive, introductory text that may be freely downloaded from 
the WWW is (Grinstead and Snell, 1997). This book, and many other re- 
sources, can be found via The Probability Web at http://mathcs.carleton.edu/ 
probweb/probweb.html. 

To study probability with complete rigour requires the use of measure theory. 
Accessible routes into this area are offered by (Capinski and Kopp, 1999) and 
(Rosenthal, 2000). 


EXERCISES 
3.1. x Suppose X ~ U(a, B). Show that for an interval [x1, x2] in (œ, B) we have 
X2 — x] 
p-a 


3.2. xx Show that (3.7) holds for a discrete random variable. Now suppose that 


PQ, < X <x) = 


X is a continuous random variable with density function f. Recall that the 


3.3. 


3.4. 


3.5. 


3.6. 


3.7. 


3.8. 


3.9. 
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density function is characterized by (3.3). What is the density function of 
aX, for a € R? Show that (3.7) holds. 

xx Using (3.6) and (3.7) show that (3.10) and (3.11) are equivalent and 
establish (3.12). 

xx A continuous random variable X with density function 


ee: for x > 0, 
0, for x <0, 


roy ={ 


where à > 0, is said to have the exponential distribution with parameter i. 
Show that in this case E(X) = 1/2. Show also that E(X?) = 2/2 and hence 
find an expression for var(X). 

xx Show that if X ~ U(«, B) then E(X?) = (a? + aß + B7)/3 and hence 
var(X) = (B — a)?/12. 

x Let X and Y be independent random variables and let œ € R be a constant. 
Show that var(X + Y) = var(X) + var(Y) and var(a + X) = var(X). 

x x x Suppose that X ~ N(0, 1). Verify that E(X) = 0. From (3.8), the sec- 
ond moment of X , E(X?), satisfies 


E(X?) = 2e ldx, 


1 Co 
V2 J- 
Using integration by parts, show that E(X?) = 1, and hence that var(X) = 


1. From (3.8) again, for any integer p > 0 the pth moment of X, E(X?), 
satisfies 


et Ady, 


E(x?) = 


1 CO 
2g xP 
al: 
Show that E(X°) = 0 and E(X*) = 3, and find a general expression for 
(XP). (Note: you may use without proof the fact that eo ee qx = 
V27.) 

xx From the definition (3.16) of its density function, verify that an N(u, o?) 
random variable has mean u and variance o?. 
xx Show that N (x) in (3.18) satisfies N (œ) + N (—&) = 1. 


3.8 Program of Chapter 3 and walkthrough 


As an alternative to the four separate plots in Figure 3.2, ch03, listed in Figure 3.4, produces a three- 


dimensional plot of the N(0, o?) density function as o varies. The new commands introduced are 


meshgrid and waterfall. We look at ø values between 1 and 5 in steps of dsig = 0.25 and plot 


30 Random variables 


%CHO3 Program for Chapter 3 
% 
% Illustrates Normal distribution 


clf 

dsig = 0.25; 
dx = 0.5; 
mu = 0; 


[X,SIGMA] = meshgrid(-10:dx:10,1:dsig:5); 

Z = exp(-(X-mu).°2./(2*SIGMA.°2))./sqrt(2* pi* SIGMA.”2); 
waterfall(X,SIGMA,Z) 

xlabel(’x’) 

ylabel(\sigma’) 

zlabel(’f(x)’) 

title(’N(0,\sigma) density for various \sigma’) 


Fig. 3.4. Program of Chapter 3: ch03.m. 


the density function for x between —10 and 10 in steps of dx = 0.5. The line 
[X,SIGMA] = meshgrid(-10:dx:10,1:dsigma:5) 


sets up a pair of 17 by 41 two-dimensional arrays X, and SIGMA, that store the ø and x values in a 
format suitable for the three-dimensional plotting routines. The line 


Z = exp(-(X-mu) .72./(2*SIGMA.2)) ./sqrt (2*pi*SIGMA.~2) ; 


then computes values of the density function. Note that the powering operator, ^, and the division 


operator, /, are preceded by full stops. This notation allows MATLAB to work directly on arrays by 
interpreting the commands in a componentwise sense. A simple illustration of this effect is 


>> [1,2,3].*[5,6,7] 
>> ans = 5 12 21 


The waterfall function is then used to give a three-dimensional plot of Z by taking slices along the 
x-direction. The resulting picture is shown in Figure 3.5. 


PROGRAMMING EXERCISES 


P3.1. Experiment with ch03 by varying dx and dsigma, and replacing 
waterfall by mesh, surf and surfc. 

P3.2. Write an analogue of ch03 for the exponential density function defined in 
Exercise 3.4. 


Quotes 


Our intuition is not a viable substitute for the more formal theory of probability. 
MARK DENNEY AND STEVEN GAINES (Denney and Gaines, 2000) 
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N(0,o) density for various o 


Fig. 3.5. Graphics produced by ch03. 


Statistics: the mathematical theory of ignorance. 
MORRIS KLINE, source www.mathacademy.com/pr/quotes/ 


Stock prices have reached what looks like a permanently high plateau. 
(In a speech made nine days before 
the 1929 stock market crash.) 
IRVING FISHER, economist, source 
www.quotesforall.com/f/fisherirving.htm 


Norman has stumbled into the lair of a chartist, 
an occult tape reader who thinks he can predict market moves by eyeballing 
the shape that stock prices take when plotted on a piece of graph paper. 
Chartists are to finance what astrology is to space science. 
It is a mystical practice akin to reading the entrails of animals. 
But its newspaper of record is The Wall Street Journal, 
and almost every major financial institution in the United States 
keeps at least one or two chartists working behind closed doors. 
THOMAS A. BASS (Bass, 1999) 
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Computer simulation 


OUTLINE 


e random number generation 


sample mean and variance 
e kernel density estimation 


quantile—quantile plots 


4.1 Motivation 


The models that we develop for option valuation will involve randomness. One of 
the main thrusts of this book is the use of computer simulation to experiment with 
and visualize our ideas, and also to estimate quantities that cannot be determined 
analytically. This chapter introduces the tools that we will apply. 


4.2 Pseudo-random numbers 


Computers are deterministic — they do exactly what they are told and hence are 
completely predictable. This is generally a good thing, but it is at odds with the 
idea of generating random numbers. In practice, however, it is usually sufficient 
to work with pseudo-random numbers. These are collections of numbers that are 
produced by a deterministic algorithm and yet seem to be random in the sense 
that, en masse, they have appropriate statistical properties. Our approach here is to 
assume that we have access to black-box programs that generate large sequences 
of pseudo-random numbers. Hence, we completely ignore the fascinating issue of 
designing algorithms for generating pseudo-random numbers. Our justification for 
this omission is that random number generation is a highly advanced, active, re- 
search topic and it is unreasonable to expect non-experts to understand and imple- 
ment programs that compete with the state-of-the-art. Off-the-shelf is better than 
roll-your-own in this context, and by making use of existing technology we can 
more quickly progress to the topics that are central to this book. 
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Table 4.1. Ten pseudo-random 
numbers from a U(0, 1) and 
an N(O, 1) generator 


U(0, 1) NO, 1) 
0.3929 0.9085 
0.6398 —2.2207 
0.7245 —0.2391 
0.6953 0.0687 
0.9058 —2.0202 
0.9429 —0.3641 
0.6350 —0.0813 
0.1500 —1.9797 
0.4741 0.7882 


0.9663 0.7366 


Table 4.1 shows two sets of ten numbers. These were produced from high- 
quality pseudo-random number generators designed to produce U(0,1) and 
NO, 1) samples.! We see that the putative U(0, 1) samples appear to be liber- 
ally spread across the interval (0, 1) and the putative N(0, 1) samples seem to be 
clustered around zero, but, of course, this tells us very little. 


4.3 Statistical tests 


We may test a pseudo-random number generator by taking M samples {&; ee , and 
computing the sample mean 


1 M 
um = 77) $i (4.1) 


i=l 


and the sample variance 


1 M 
2 2 
— = ; 4.2 
om wai Le um) (4.2) 
The sample mean (4.1) is simply the arithmetic average of the sample values. The 
sample variance is a similar arithmetic average corresponding to the expected value 
in (3.10) that defines the variance. (You might regard it as more natural to take the 
sample variance as (1/M) pe (ep Se m)”; however, it can be argued that scaling 


1 all computational experiments in this book were produced in MATLAB, using the built-in functions rand and 
randn to generate U(0, 1) and N(O, 1) samples, respectively. To make the experiments reproducible, we set the 
random number generator seed to 100; that is, we used rand ( ‘state’ ,100) and randn(‘state’,100). 
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Table 4.2. Sample mean (4.1) and sample variance 
(4.2) using M samples from a U(O, 1) and an 
N(O, 1) pseudo-random number generator 


U(0, 1) N(, 1) 

M UM ou HM o 
10? 0.5229 0.0924 0.0758 1.0996 
105 0.4884 0.0845 0.0192 0.9558 
10+ 0.5009 0.0833 —0.0115 0.9859 
10° 0.5010 0.0840 0.0005 1.0030 


by M — 1 instead of M is better. This issue is addressed in Chapter 15.) Results for 
M= 107, 10°, 10* and 10° appear in Table 4.2. We see that as M increases, the 
U(0, 1) sample means and variances approach the true values J and b = 0.0833 
(recall Exercise 3.5) and the N(O, 1) sample means and variances approach the true 
values 0 and 1. 

A more enlightening approach to testing a random number generator is to divide 
the x-axis into subintervals, or bins, of length Ax and count how many samples 
lie in each subinterval. We take M samples and let N; denote the number of sam- 
ples in the bin [i Ax, (i + 1) Ax]. If we approximate the probability of X taking 
a value in the subinterval [7 Ax, (i + 1) Ax] by the relative frequency with which 
this happened among the samples, then we have 


Ni 
PG@Ax < X <@+1)Ax) ~ m (4.3) 


On the other hand, we know from (3.3) that, for a random variable X with density 
fœ, 


(i+1)Ax 
PGAx < X < (i +1)Ax) = J f(x)dx. (4.4) 
i Ax 
Letting x; denote the midpoint of the subinterval [i Ax, (i + 1)Ax] we may use 
the Riemann sum approximation 


(i+1)Ax 
J f(x)dx ~ Axf (xi). (4.5) 
i Ax 

(Here, we have approximated the area under a curve by the area of a suitable rect- 
angle — draw a picture to see this.) Using (4.3)-(4.5), we see that plotting 
N;/(M Ax) against x; should give an approximation to the density function values 
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Fig. 4.1. Kernel density estimate for a U(0, 1) generator, with increasing number 
of samples. Vertical axis is N;/(M Ax), for Ax = 0.05. 


J (x). This technique, and more sophisticated extensions, fit into the area of kernel 
density estimation. 


Computational example We compute a simple kernel density estimate for 
a U(O,1) generator, using intervals of width Ax = 0.05. Since f(x) is 
nonzero only for 0 < x < 1, we take i = 0, 1,2,..., 19. In Figure 4.1 we plot 
N;/(M Ax) against x; for the number of samples M = 107, 104, 10°, 10°. These 
points are plotted as diamonds joined by straight lines for clarity. We see that as 
M increases the plot gets closer to that of a U(O, 1) density. © 


Computational example In Figure 4.2 we perform a similar experiment with 
an N(0, 1) generator. Here, we took intervals in the region —4 < x < 4 and used 
bins of width Ax = 0.05. (Samples that were smaller than —4 were added to the 
first bin and samples that were larger than 4 were added to the last bin.) We used 
M = 10°, 104, 10°, 10°. The correct N(O, 1) density curve is superimposed in 
white. We see that the density estimate improves as M increases. © 


We now look at another technique for examining statistical aspects of data. For 
a given density function f (x) and a given O < p < 1 define the pth quantile of f 
as Z(p), where 


z(p) 
J f(x) dx = p. (4.6) 
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Fig. 4.2. Kernel density estimate for an N(0, 1) generator, with increasing num- 
ber of samples. Vertical axis is N;/(M Ax), for Ax = 0.05. 


Given a set of data points 1, £2, . . . , Em , a quantile—quantile plot is produced by 
(a) placing the data points in increasing order: & ; as pets SEU 


(b) plotting Er against z(k/(M + 1)). 


The idea of choosing quantiles for equally spaced p = k/(M + 1) is that it 
‘evens out’ the probability. Figure 4.3 illustrates the M = 9 case when f (x) is the 
N(O, 1) density. The upper picture emphasizes that the z(k/(M + 1)) break the 
x-axis into regions that give equal area under the density curve — that is, there is 
an equal chance of the random variable taking a value in each region. The lower 
picture in Figure 4.3 plots the function f(x) and shows that z(k/(M + 1)) are the 
points on the x-axis that correspond to equal increments along the y-axis. The idea 
is that, for large M, if the quantile—quantile plot produces points that lie approxi- 
mately on a straight line of unit slope, then we may conclude that the data points 
‘look as though’ they were drawn from a distribution corresponding to f(x). To 
justify this, if we divide the x-axis into M bins where x is in the kth bin if it is 
closest to z(k/(M + 1)), then, having evened out the probability, we would expect 
roughly one &; value in each bin. So the smallest data point, E, should be close 
to z(1/(M + 1)), the second smallest, ay should be close to z(2/(M + 1)), and 
so on. 


Computational example Figure 4.4 tests the quantile—quantile idea. Here we 
took M = 100 samples from N(0, 1) and U(O, 1) random number generators. 
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f(x) 


Fig. 4.3. Asterisks on the x-axis mark the quantiles z(k/(M + 1)) in (4.6) for an 
N(O, 1) distribution using M = 9. Upper picture: the quantiles break the x-axis 
into regions where f(x) has equal area. Lower picture: equivalently, the quantiles 
break the x-axis into regions where N (x) has equal increments. 


Each data set was plotted against the N(O, 1) and U(0, 1) quantiles. A reference 
line of unit slope is added to each plot. As expected, the data set matches well 
with the ‘correct’ quantiles and very poorly with the ‘incorrect’ quantiles. © 


Computational example In Figures 4.5 and 4.6 we use the techniques intro- 
duced above to show the remarkable power of the Central Limit Theorem. Here, 
we generated sets of U(0, 1) samples {&}/'_,, with n = 10°. These were com- 
bined to give samples of the form 


par §i — nu 
a/n : 
1 


where u = 5 and o? = b- We repeated this M = 104 times. These M data 
points were then used to obtain a kernel density estimate. In Figure 4.5 we 
used bins of width Ax = 0.5 over [—4, 4] and plotted N;/(M Ax) against x;, 
as described for Figure 4.1. Here we have used a histogram, or bar graph, so 
each rectangle is centred at an x; and has height N;/(M Ax). The N(O, 1) den- 
sity curve is superimposed as a dashed line. Figure 4.6 gives the corresponding 
quantile—quantile plot. The figures confirm that even though each é; is nothing 


like normal, the scaled sum ()77_, & — nw)/(o/n) is very close to N(0, 1). > 


(4.7) 
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Fig. 4.4, Quantile—quantile plots using M = 100 samples. Ordered samples i, 
&,...,&y on the x-axis against quantiles z(k/(M + 1)) on the y-axis. Pictures 
show the four possible combinations arising from N(O, 1) or U(0, 1) random num- 
ber samples against N(0, 1) or U(O, 1) quantiles. 
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Fig. 4.5. Kernel density estimate for samples of the form (4.7), with N(O, 1) 
density superimposed. 


40 Computer simulation 


Fig. 4.6. Quantile—quantile plot for samples of the form (4.7) against N(O, 1) 
quantiles. 


4.4 Notes and references 


Much more about the theory and practice of designing and implementing computer 
simulation experiments can be found in (Morgan, 2000) and (Ripley, 1987). In 
particular, those references mention rules of thumb for choosing the bin width as a 
function of the sample size in kernel density estimation. 

The pLab website, which lives at http://random.mat.sbg.ac.at/, gives informa- 
tion on random number generation, and has links to free software in a variety of 
computing languages. 

A very readable essay on pseudo-random number generation can be found in 
(Nahin, 2000). That book also contains some wonderful probability-based prob- 
lems, with accompanying MATLAB programs. 

Cleve’s Corner articles “Normal behavior’, spring 2001, and ‘Random 
Thoughts’, fall 1995, which are downloadable from www.mathworks.com/ 
company/newsletter/clevescorner/cleve-toc.shtml are informative musings on 
MATLAB’s pseudo-random number generators. 

As an alternative to ‘pseudo-’, it is possible to buy ‘true’ random numbers that 
are generated from physical devices. For example, one approach is to record decay 
times from a radioactive material. The readable article “Hardware random number 
generators’, by Robert Davies, can be found at www.robertnz.net/hwrng.htm. 
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EXERCISES 


xx Some scientific computing packages offer a black-box routine to evaluate 
the error function, erf, defined by 


erf(x) := = Í T dt. (4.8) 


Show that the N(0, 1) distribution function N (x) in (3.18) can be evaluated 
as 


1+erf (x /~v2) 


N(x) = = (4.9) 


xx Show that samples from the exponential distribution with parameter À, as 
described in Exercise 3.4, may be generated as —(log(&;))/4, where the {&;} 
are U(0, 1) samples. 

xx Show that the quantile z(p) in (4.6) for the N(O, 1) distribution function 
N (x) can be written as 


z(p)= V2 erfinv(2p — 1). 


Here, erfinv is the inverse error function; so erfinv(x) = y means erf(y) = x, 
where erf is defined in (4.8). 

xx In the case where f(x) is the density for the exponential distribution with 
parameter A = 1, as described in Exercise 3.4, show that the quantile z (p) 
in (4.6) satisfies z(p) = —log(1 — p). 


4.5 Program of Chapter 4 and walkthrough 


In ch04, listed in Figure 4.7, we repeat the type of computation that produced Figure 4.5. Here, we 


use samples é; in (4.7) that are the exponential of the square root of U(0, 1) samples. It follows from 
Exercise 21.2 that we should take u = 2 and o = y (e2 — 7)/2. 

The line colormap([0.5 0.5 0.5]) sets the greyscale for the histogram; [0 0 0] is black and 
[1 1 1] is white. We then use rand(’state’ , 100) to set the seed for the uniform pseudo-random 


number generator, as described in the footnote of Section 4.2. After specifying n, M, mu and sigma, 


and initializing S to an array of zeros, we perform the main task in a single for loop. The command 


rand(n, 1) creates an array of n values from the U(0, 1) pseudo-random number generator. We then 


apply sqrt to take the square root of each entry, exp to exponentiate and sum to add up the result. In 


other words 


sum(exp(sqrt (rand(n,1)))) 


corresponds to a sample of 
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%CHO04 Program for Chapter 4 
% 
% Histogram illustration of Central Limit Theorem 


clf 
colormap([0.5 0.5 0.5]) 
rand(‘state’,100) 


n= 5e+2; 
M = 1e+4; 
mu = 2; 


sigma = sqrt(0.5*(exp(2)-7)); 


S = zeros(M,1); 
for k = 1:M 

S(k) = (sum(exp(sqrt(rand(n,1)))) - n*mu)/(sigma*sqrt(n)); 
end 


AAPA% Histogram %%%%%%%%%%% APAPA 
dx = 0.5; 

centers = [-4:dx:4]; 

N = hist(S,centers); 

bar(centers,N/(M*dx)) 


hold on 

x = linspace(-4,4,100); 

y = exp(-0.5*x.°2)/sqrt(2* pi); 
plot(x,y,’r—,’Linewidth’,4) 
legend(’N(0,1) density’,/Sample data’) 
grid on 


Fig. 4.7. Program of Chapter 4: ch04.m. 


so overall, S(k) stores a sample of 


Shae -nu 
oxn ` 


The line N = hist(S,centers) ; creates a one-dimensional array N, whose ith entry records the 
number of values in S lying in the ith bin. Here, a point is mapped to the ith bin if its closest value 
in centers is centers(i). The command bar (centers ,N/(M*dx)) then draws a bar graph, or 
histogram, using this information. (We scale by M*dx so that the area of the histograph adds up to 
one.) 

Because we issued the command hold on, the second plot, a dashed line for the exact density 
curve, adds to, rather than replaces, the first. 
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PROGRAMMING EXERCISES 


P4.1. Adapt ch04.m to the case where é; in (4.7) are from the exponential distri- 
bution with parameter à = 1. [Hint: make use of Exercise 3.4 and Exercise 4.2.] 


P4.2. Adapt ch04.m so that it produces a quantile—quantile plot, as in Figure 4.6. 
(Note that the program of Chapter 5 shows how such a plot may be generated.) 


Quotes 


In 1955, before computers were so common, 
the RAND Corporation published a book entitled A Million Random Digits. 
It was used in selecting random trials for experimental designs and simulations 
(and perhaps as bedtime reading for insomniacs?). 
It was soon realized, however, that if everyone always started on page one, 
then all trials and simulations by all the book’s users would depend upon the quirks of 
the same random sequence. 
This generated much debate 
on how to select a random starting point in the table of random numbers. 
MICHAEL T. HEATH (Heath, 2002) 


The first thing needed for a stochastic simulation is a source of randomness. 
This is often taken for granted but is of fundamental importance. 
Regrettably many of the so-called random functions supplied with the most 
widespread computers 
are far from random, 
and many simulation studies have been invalidated as a consequence. 
BRIAN D. RIPLEY (Ripley, 1997) 


Here is an interesting number: 
0.950 129 285 147 18. 
This is the first number produced by the MATLAB random number generator with its 
default settings. 
Start up a fresh MATLAB, set format long, type rand, 
and it’s the number you get. 
If all MATLAB users, all around the world, on all different computers, 
keep getting this same number, is it really ‘random’? 
No, it isn’t. 
Computers are (in principle) deterministic machines 
and should not exhibit random behavior. 
If your computer doesn’t access some external device, 
like a gamma ray counter or a clock, 
then it must really be computing pseudorandom numbers. 
CLEVE B. MOLER AND KATHRYN A. MOLER, in Numerical Computing with 
MATLAB, 
see www.mathworks.com/moler/ 


5 


Asset price movement 


OUTLINE 


e efficient market hypothesis 
e examples of real asset data 
e tests for i.i.d. and normality 
e assumptions for the model 


5.1 Motivation 


In order to value an option, we must develop a mathematical description of how 
the underlying asset behaves. This chapter gives examples of real stock market 
data and performs some basic statistical tests. The tests pave the way for the math- 
ematical description that we introduce in the next chapter, but are definitely not 
intended to form an exhaustive justification of the model. We begin with an out- 
line of a key hypothesis, and finish by listing some of the assumptions that will go 
into our analysis. 


5.2 Efficient market hypothesis 


The price of an asset is, of course, a measure of investors’ confidence, and, as 
such, is strongly dependent upon news, rumours, speculation, and so on. Although 
an oversimplification, it is reasonable to assume that the market responds instanta- 
neously to external influences, and hence: 


the current asset price reflects all past information. 


This simple conclusion is known as the (weak form of the) efficient market hy- 
pothesis. Under this hypothesis, if we want to predict the asset price at some future 
time, knowing the complete history of the asset price gives no advantage over 
just knowing its current price — there is no edge to be gained from ‘reading the 
charts.’ 
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Fig. 5.1. Daily IBM share price from January to September 2001. 


From a modelling point of view, if we take on board the efficient market hypoth- 
esis, then an equation to describe the evolution of the asset from time ¢ to t + At 
need involve the asset price only at time ¢ and not at any earlier times. 


5.3 Asset price data 


In Figure 5.1 we plot the daily IBM share prices from January to the end of 
September 2001. These are the close-of-trading prices; that is, the price at the last 
transaction made in each trading day. In the traditional manner, we have ‘joined 
the dots’ so that successive data points are linked by straight lines. Figure 5.2 
gives the corresponding weekly IBM share prices from January 1998 to December 
2001. There are 184 data points in Figure 5.1 and 209 in Figure 5.2. Although cov- 
ering different timescales, both pictures display the same qualitative ‘jaggedness’. 
This type of up/down uncertainty is familiar to anybody who has seen stock market 
data displayed in graphical form. 

To examine this data, it is reasonable to treat it on the same level as the output 
from a pseudo-random number generator and test whether it has any statistical 
properties. In Figure 5.3 we give the results of such a test. The upper pictures 
involve the daily returns, 

„dily S(i+1) — S (ti) 
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Fig. 5.3. Statistical tests of IBM share price data. Upper: daily. Middle: weekly. 


Lower: N(0, 1) samples for comparison. 
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where S(t;) and S(t;41) are the asset prices on successive days, as used in 
Figure 5.1. These daily returns were normalized to 
p daily = aie =H 
l o z 
where u and ø? are the computed sample mean and sample variance, defined 
in (4.1) and (4.2), respectively. If the daily return data looks like i.i.d. sam- 
ples from a normal distribution, then Po will look like i.i.d. N(O, 1) samples. 
The upper left picture in Figure 5.3 gives a kernel density estimate for the oad 
data in the form of a histogram, with the N(0, 1) density curve (3.15) superim- 
posed as a dashed line. To estimate the corresponding distribution function, we 
may use a cumulative sum histogram, where in each bin we record the proportion 
of samples that fall in that bin, or in a bin to the left. This produces the histogram 
in the middle picture. The N(O, 1) distribution function (3.18) is superimposed as 
a dashed line. Finally, in the upper right picture we give a quantile—quantile plot, 
as described in Chapter 4, using N(0, 1) quantiles. The three middle pictures in 
Figure 5.3 present the same results for the normalized weekly returns, using the 
data from Figure 5.2. As a basis for comparison, the lower pictures give the output 
that arises when 200 points from an N(0, 1) pseudo-random number generator are 
subjected to the same scrutiny. 

Overall, Figure 5.3 suggests that the daily and weekly asset returns behave in a 
similar manner to normally distributed i.i.d. samples. The quantile—quantile plots, 
which are the most revealing, possibly indicate that the match is least accurate 
at the extremes of the range — this fat tail behaviour will be mentioned again in 
Section 7.4. 

As a final point, we remark that since the daily and weekly returns are quite 
small, the approximation log(1 + x) ~ x gives 


ibe (Se?) ibe (1 P S(ti+1) — =) n Siti) = SG) 6.1) 


S(ti) S(t) S(t) 


and hence we would see essentially the same pictures as those in Figure 5.3 if we 
replaced the returns with the log ratios, log (S(tj+1)/S(&)). 


5.4 Assumptions 


In the next chapter we develop a mathematical description of the asset price move- 
ment that is intended to capture the broad features that are observed in practice. 
Before we do that, we take the opportunity to list some of the assumptions that will 
be made in the subsequent analysis. 
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e The asset price may take any non-negative value. 

e Buying and selling an asset may take place at any time 0 < t < T. 

e It is possible to buy and sell any amount of the asset. 

e The bid—ask spread is zero — the price for buying equals the price for selling. 

e There are no transaction costs. 

e There are no dividends or stock splits. 

e Short selling is allowed — it is possible to hold a negative amount of the asset. 

e There is a single, constant, risk-free interest rate that applies to any amount of money 
borrowed from or deposited in a bank. 


5.5 Notes and references 


The efficient market hypothesis is at best an approximation to reality. A classic text 
that espouses the hypothesis is (Malkiel, 1990). A more recent book that analyses 
vast amounts of stock market data and casts severe doubt on the efficient market 
hypothesis is (Lo and MacKinlay, 1999). It is important to keep in mind, however, 
that it is a big leap to go from 


(a) claiming that the current asset price movement is somehow correlated with historical 
asset price data, to 

(b) developing a method that can make these correlations sufficiently explicit to be of use 
for prediction. 


Bass (Bass, 1999) describes what seems to be one of the few successful, systematic 
attempts in this direction. The topic is mentioned further in Section 7.4. 

The data used in Figures 5.1-5.3 was downloaded from the Yahoo! Fi- 
nance website at http://finance.yahoo.com/ and processed using MATLAB code 
based on the tools developed by Petter Wiberg at www.maths.warwick.ac.uk/ 
wiberg/MathFinance/. 

It is worth emphasizing that the tests in Section 5.3 were designed solely for the 
purpose of illustration. There are many practical issues to address before a serious 
statistical analysis of stock market data can be performed. Most notably: 


e There may be missing data if no trading took place between times t; and t;+1. 

e For many data sets, each price may correspond to either a buy or a sell — there is an 
in-built noise level at the order of the bid—ask spread. 

e The data may require adjustments to account for dividends and stock splits. 

e When determining the time interval, t;+1 — t;, between price data, a decision must be 
made about whether to keep the clock running when the stock market has closed. Does 
Friday night to Monday morning count as 25 days, or zero days? 

e For an asset that is not heavily traded, the time of the last trade may vary considerably 
from day to day. Consequently, daily closing prices, which pertain to the final trade for 
each day, may not relate to equally spaced samples in time. 
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The book (Lo and MacKinlay, 1999) is a good source of practical information for 
stock market data analysis. 

Many exchanges have informative websites, including the American Stock 
Exchange: www.amex.com/, the Chicago Board Options Exchange: www. 
cboe.com/Home/, the London Stock Exchange: www.londonstockexchange. 
com/, the New York Stock Exchange: www.nyse.com/. 


EXERCISES 


5.1. xx Consider the following quote from Eugene Fama, who was Myron 
Scholes’ thesis adviser, which can be found in (Lowenstein, 2001, page 71). 


If the population of price changes is strictly normal, on the average for any stock 
...an observation more than five standard deviations from the mean should be ob- 
served about once every 7000 years. In fact such observations seem to occur about 
once every three to four years. 


Given that for X ~ N(u, 07), P(X — u| > 50) = 5.733 x 1077, deduce 
how many observations per year Fama is implicitly assuming to be made. 
5.2. Complete the following stock market report in an apt and amusing manner. 


e Knives fell sharply. 
e Guacamole dipped. 
e Toilet tissue bottomed out .... 


5.6 Program of Chapter 5 and walkthrough 


The program ch05 shows one way to compute a quantile—quantile plot, as seen in Figures 4.4, 4.6 and 
5.3. It is listed in Figure 5.4. We use MATLAB’s N(0, 1) pseudo-random number generator, randn. 
The line samples = randn(M,1), assigns M such samples to the array samples. We then use 
ssort = sort (sample), to create an array ssort containing the elements of samples, rearranged 
into ascending order. The line pvals = [1:M]/(M+1), then sets up equally spaced points 1/(M + 
1),2/(M + 1),3/(M@ + 1),..., M/(M + 1) and zvals = sqrt(2)*erfinv(2*pvals-1); com- 
putes the required quantiles, as described in Exercise 4.3. We then plot the ordered samples against 
the quantiles and superimpose a reference line of slope one. 


PROGRAMMING EXERCISES 


P5.1. Use the cumulative sum function cumsum and the bar graph function bar to 
produce a cumulative density plot from chO5.m, as in the lower middle picture of 
Figure 5.3. 


P5.2. Use the code at www.maths.warwick.ac.uk/wiberg/MathFinance/ to 
manipulate and display real stock market data. 
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%CHO05 Program for Chapter 5 
% 
% Illustrates quantile plot 


clf 
randn(‘state’,100) 


M = 200; 


samples = randn(M,1); 
ssort = sort(samples); 


pvals = [1:M]/(M+1); 
zvals = sqrt(2)*erfinv(2* pvals-1); 


plot(ssort,zvals,’rx’) 

hold on 

xlim = max(abs(zvals))+1; 

plot([-xlim, xlim],[-xlim,xlim],’g—’) % Reference of slope 1 
title(’N(0,1) quantile-quantile plot’) 

grid on 


Fig. 5.4. Program of Chapter 5: ch05.m. 


Quotes 


A battle rages between those who say the financial markets 
are theoretically impossible to beat and those who say, 
‘Hey, look at me, I’m a billionaire.’ 
On one side are the Nobel laureates, 
ensconced in the University of Chicago Business School, 
who are renowned for developing equations describing ‘efficient’, 
that is, unbeatable, markets. 
On the other side are the speculators who beat them year in, year out 
with techniques ‘proven’ not to work. 
THOMAS A. BASS (Bass, 1999) 


Who’d have imagined that our largest single equity underwriting 
would coincide with the largest drop in history in the stock market? 
Then, who’d have imagined that our first big junk bond deal 
would coincide with the crash of the junk bond market? 
It was striking how little control we had of events, 
particularly in view of how assiduously 
we cultivated the appearance of being in charge 
by smoking big cigars and saying **** all the time. 
MICHAEL LEWIS (Lewis, 1989) 


An incident of ‘fat finger syndrome’ — inadvertently pressing the wrong button 
on a computer keyboard — landed an American investment bank 
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with multimillion pound losses yesterday 
and is expected to cost the young city trader involved his job. 
... The deal amounted to £300m rather than £3m 
and flashed across stock market screens just as the stock market was about to close, 
causing a precipitous fall in the Footsie, the barometer of British corporate health. 
Slip of the finger that cost city dearly, the Guardian, 16 May 2001 


The traditional view in economics 
is that financial agents are completely rational with perfect foresight. 
Markets are always in equilibrium, 
which in economics means that trading always occurs 
at a price that conforms to everyone’s expectations of the future. 
Markets are efficient, which means that there are no patterns in prices 
that can be forecast based on a given information set. 
The only possible changes in price are random, 
driven by unforecastable external information. 
Profits occur only by chance. 
In recent years this view is eroding. 
J. DOYNE FARMER (Farmer, 1999) 
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Asset price model: Part I 


OUTLINE 


e discrete asset model 
e continuous asset model 


lognormal distribution 
e confidence intervals 


6.1 Motivation 


Our aim in this chapter is to motivate and derive the classic model for asset price 
behaviour. We do this in a heuristic manner, making clear the assumptions that are 
being made and keeping in mind that the model will be used as the basis for an 
option valuation theory. 

Given the asset price So at time t = 0, our objective is to come up with a process 
that describes the asset price S(t) for all times 0 < t < T. Due to the unpredictable 
nature of assset price movements, S(t) will be a random variable for each t. Al- 
though asset prices are typically rounded to one or two decimal places, we assume 
here that an asset may have any price > 0. 

Our approach is to set up an expression for the relative change over an inter- 
val of time ôt and then let ôt —> 0 in order to get an expression that is valid for 
continuous ¢. 


6.2 Discrete asset model 


As a Starting point for our model we note from Exercise 2.2 that the change in the 
value of a risk-free investment over a small time interval êt can be modelled as 


D(t + dt) = Dit) + ré6tD(o), (6.1) 


where r is the interest rate. In order to account for the typical, unpredictable 
changes in asset price, we will add a random element to this equation. We saw 
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in Chapter 5 that the efficient market hypothesis says that the current asset price 
reflects all the information known to investors, and hence any change in the price 
is due to new information. We may build this into our model by adding a ran- 
dom ‘fluctuation’ increment to the interest rate equation and making these incre- 
ments independent for different subintervals. To make this precise, let t; = i ôt, 
so that asset prices are to be determined at discrete points {t;}. (We will then 
let ôt — 0 to get an asset price model over 0 < t < T.) Our discrete-time model 
is 


S(tiz1) = S(ti) + USt S (ti) + oV8t Yi S (ti), (6.2) 
where 


e u is a constant parameter. (Typically u > 0, so that uôtS(t;) represents a general up- 
ward drift of the asset price. The parameter u plays the same role as the interest rate r 
in (6.1).) 

e o > Ois a constant parameter that determines the strength of the random fluctuations. 

e Yo, Y1, Yo,... are iid. N(O, 1). 


It is worth emphasizing a few points. 


(i) Since a N(O, 1) random variable is symmetric about the origin, the fluctuation factor 
oV/5tY; is equally likely to be positive or negative, and the probability that it lies in 
an interval [a, b] is the same as the probability that it lies in the interval [—b, —a]. 

(ii) The presence of the factor ôt (rather than some other power of ôt) turns out to be 
necessary in order for a sensible continuous-time limit to exist. Exercise 6.1 follows 
this through. 

(iii) The choice of a normal distribution for Y; is not arbitrary — because of the Central 
Limit Theorem, we would arrive at the same continuous-time model for S(t) if we 
just assumed that {Y;};>0 were i.i.d. with zero mean and unit variance. Exercise 6.2 
asks you to confirm this. 


The parameter u in (6.2) is usually called the drift and o is called the volatility. 
The model is statistically the same if ø is replaced by —o, see Exercise 6.3. Con- 
vention dictates that o is taken to be > 0. Typical values for o lie between 0.05 
and 0.5, that is, 5% and 50% volatility. Because we are measuring time in years, 
the units of ø? are per annum. The drift parameter is typically between 0.01 and 
0.1, but, as we will see in Chapter 8, its value turns out to be irrelevant in valuing 
an option. 

We point out that in the model (6.2), the returns (S(¢;41) — S (ti))/S (ti) form 
a normal i.i.d. sequence, in line with the broad conclusions that we drew in Sec- 
tion 5.3 after examining real data. 
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6.3 Continuous asset model 


Suppose we consider the time interval [0, t] with t = Ldt. We know S(O) = So and 
the discrete model (6.2) gives us expressions for S(t), S(26t),..., S(Ldt = t). 
The plan is to let ôt — 0, and hence let L — oo, to get a limiting expression for 
S(t). 

The discrete model (6.2) says that over each df time interval the asset price gets 
multiplied by a factor 1 + pdt + o /5tY;, and hence 


L-1 
Sa) = So J] (1 + pdt + oV5t¥; ) l 
i=0 
Dividing through by So and taking logs gives 
log (=) = X log(1 + uôt + o v'ôtY;i). (6.3) 
0 i=0 


We are interested in the limit ôt — 0, so we would like to exploit the approxima- 
tion log(1 + €) X € — e?/2 + ---, for small e. There is a technical issue that we 
will gloss over. The quantity Y; in (6.2) is a random variable, not just a real num- 
ber, but it can be shown that what we are about to do is justifiable because E(Y > 
is finite. Continuing in the belief that the log expansion remains valid, we obtain 


S(t L-1 
log (Œ) N X (ust + oV8tY; — 5076tY?), (6.4) 
So i=0 


where we have ignored terms that involve the power 5t°/ or higher. Exercise 6.4 
asks you to show that 


E (not + oV51Y; — 50761Y?) = nôt — 30751 (6.5) 
and 
var (use +o[vôtY; — 3075t¥?) = o?ôt + higher powers of dt. (6.6) 


Now, insight from the Central Limit Theorem suggests that log(S(t)/So) in (6.4) 
will behave like a normal random variable with mean L (uôt — 507 6t) =(u— loD) 
and variance Lo*ét = o°t, that is, approximately, 


log (Œ) ~ N((u — $0°)t, o°t). (6.7) 
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Based on these arguments, our limiting continuous-time expression for the asset 
price at time t becomes 


S(t) = Spe 2° '49ViZ where Z ~ N(O, 1). (6.8) 


In this derivation there was nothing special about starting at time zero — we can 
equally well argue that the asset price evolves from time t = tı to t = t2, where 
t2 > tı, according to 


S =) 12 2 

&(—— ] ~N((u -= 30°) (2 — t1), 0° (2 — t1)). 
€ (ti) ( i ) 

A key point is that across non-overlapping time intervals, the normal random vari- 


ables that describe these changes will be independent. This follows because the 
Y; in (6.2) are i.i.d. Hence, for t3 > t2 > tı we have 


S 
g ($) ~ N((u — 307)(6 — n), 0%(t3 — t2)), 


S(t 
and is independent of log (=) : 
1 


So we can describe the evolution of the asset over any sequence of time points 


O=1 <t, <th <t; <- <ty by 


S(tia1) = S(t eer 27 ito Viti foriid. Z;~N(O,1). (6.9) 


6.4 Lognormal distribution 


A random variable S(t) of the form (6.8) has a so-called lognormal distribution; 
that is, its log is normally distributed. Note from (6.8) that since Sọ > 0, S(t) is 
guaranteed to be positive at any time; we have P(S(t) > 0) = 1, for any t > 0. So 
S(t) takes values in (0, œo). The corresponding density function for S(t) is 


Si (=s = 07/2") 


o2 
f@= eect . forx >0, (6.10) 
xon/20t 


with f(x) = 0, for x < 0, see Exercise 6.5. 
The expected value, second moment, and variance of S(t) with this model turn 
out to be 


ECS) = Soe”, (6.11) 
(S(t)2) = SReCHto N, (6.12) 
var(S(t)) = Sĉe? (e7 — 1), (6.13) 


see Exercise 6.6. 
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t=1 


0 0.5 1 1.5 2 2.5 3 3.5 4 


Fig. 6.1. Lognormal density (6.10) for u = 0.05, So = 1, with o = 0.3 (solid) 
and ø = 0.5 (dashed). Upper picture t = 1. Lower picture t = 3. 


Computational example In Figure 6.1 we set Sg = 1 and u = 0.05, and plot 
the lognormal density function (6.10) for o = 0.3 and ø = 0.5. The upper pic- 
ture is for t = 1 and the lower picture for t = 3. Note that the density is skewed — 
it has no vertical axis of symmetry. We know from (6.13) that the variance of S(t) 
grows with ¢, and this is clear from the figure — the density function spreads out 
when ¢ increases. The mean of S(t) also grows with ¢, from (6.11), although this 
is less obvious in the figure. © 


We are deliberately avoiding the direct use of stochastic calculus in this book. 
However, it is worth mentioning that the process S(t) defined by (6.9) can be 
regarded as the solution of a stochastic differential equation (SDE). In this context, 
S(t) is often referred to as geometric Brownian motion. Section 6.6 gives some 
routes into this fascinating topic. 


6.5 Features of the asset model 


We can get some feeling for a continuous random variable by examining its confi- 
dence intervals. Suppose that 


P (a < X <b)=0.95. 
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Then we say that [a, b] is a 95% confidence interval for X. In the case where X 
is normal, there is no simple formula for the inverse of the distribution function 
N (x) in (3.18), and hence confidence intervals must be computed numerically. It 
is found that for X ~ N(O, 1), 


P (|X| < 1.96) = 0.95, (6.14) 


see Exercise 6.7, so [—1.96, 1.96] is a 95% confidence interval for X. More gen- 
erally, for X ~ N(u, 07), we have (Y — u)/o ~ N(O, 1), so 

P (u — 1.960 < Y < u + 1.960) = 0.95, (6.15) 
and hence [u — 1.960, u + 1.960] is a 95% confidence interval. This result is 
often expressed along the lines of 


for i.i.d. normal samples, 95 times out of 100 the sample lies within two standard deviations 
of the mean. 


It follows from (6.7) that 
[Sge 1260 ViH (307) Spel 260 vitu- 30°) (6.16) 
is a 95% confidence interval for the asset price S(t), see Exercise 6.9. If t is small, 
then 
e71 960 Vi+(u- 507) re e71 960/1 x 1 — 1.960/1 


and 


el aVit(U-307)t o plI60VT o 1 1.960. 


So the confidence interval is approximately 
[So(1 — 1.960 Vt), So(1 + 1.960-V7)]. 


The width of this interval is 2501.960 ~t. If we regard the confidence interval 
width as a measure of the uncertainty in the future asset price, then this result 
explains the traders’ rule-of-thumb that 


over small time periods, uncertainty grows like the square root of time. 


Although option valuation is concerned only with the asset price over a fixed 
time horizon, [0, T], it is interesting to see what the model (6.8) predicts about 
long term behaviour. Since u and o are positive, we see from (6.12) that 


lim E(S(t)?) = o, as t— o. 
too 


In words, we say that the asset tends to infinity in mean square as time increases. 
On the other hand, it can be shown that the (u — 50°)t term dominates the o /tZ 
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term in (6.8), so that, with probability 1, 


œo, if w— z07>0, 


1 


(6.17) 
0, if u- 407 <0. 


lim S(t) = | 
t>o 
So, according to the model, if the volatility is sufficiently large (o? > 2) then, 
with probability 1, the asset price will eventually decay to zero. 


6.6 Notes and references 


The asset price model that we developed is extremely widely used in mathematical 
finance. The discrete version (6.2) can be regarded as a numerical approximation 
to the SDE formulation. The text (Kloeden and Platen, 1992) is the classic in this 
area. The expository articles (Higham, 2001; Higham and Kloeden, 2002) give 
lower level entry points. 

The continuous model characterized by (6.8) and (6.9) is the solution to an SDE. 
Reasonably accessible SDE texts are (Gard, 1988; Mao, 1997; Øksendal, 1998), 
although all require some background in stochastic processes — the text (Brzeźniak 
and Zastawniak, 1999) is a good place for beginners to start. 

The result (6.17) can be established through the Strong Law of Large Numbers, 
and the u = lo? case can be dealt with by the Law of the Iterated Logarithm; these 
laws are discussed, for example, in (Grimmett and Stirzaker, 2001; Kloeden and 
Platen, 1999). 

Although widely used, the lognormal asset price model is, of course, extremely 
simplistic and open to criticism. Section 7.4 gives pointers to some of the work 
that has been done on alternative models. 


EXERCISES 


6.1. «x Consider the following variations on the discrete model: 
S(ti+1) = S (ti) + uôtS (ti) + o ôtY;S (ti), 
and 
SCti41) = S(t) + StS C) + o ôt EYS C). 


By mimicking the heuristic derivation that led to the continuous model 
(6.8), show that neither of these two variations is satisfactory. 

6.2. xx» Consider the discrete model (6.2) in the case where {Y;} are general 
i.i.d. random variables with zero mean and unit variance (i.e., not neces- 
sarily normal). Assume also that E(Y >) and Y$) are finite. Mimic the 
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6.3. 


6.4. 
6.5. 


6.6. 
6.7. 


6.8. 


6.9. 


6.10. 
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heuristic derivation that led to (6.8) and show that the same continuous 
model arises. 

xx Explain why the model (6.2) is “statistically the same if o is replaced by 
—o’. 

xx Verify (6.5) and (6.6). [Hint: use Exercise 3.7.] 

xxx Show that S(t) in (6.8) has density function (6.10). [Hint: use the 
characterization P(a < S(t) < b) = f. i f (x) dx from (3.3).] 

x x x Using (3.8), show that (6.11), (6.12) and (6.13) follow from (6.10). 
xx Let a be the number such that 


P(|Z| <a) =0.95, where Z ~ N(0, 1). 


Recalling that N (-) denotes the N(0, 1) distribution function, show that a 
satisfies 


0.05 


After referring to Exercise 13.3, show that œ satisfies a = /2 erfinv 
(0.95). Typing this into MATLAB gives a = 1.9600 (to 4 decimal 
places). 

x Given that P (|Z| < 2.58) = 0.99, for Z ~ N(O, 1), show how (6.14) 
changes when a 99% confidence interval is required. 

x Show from (6.7) that (6.16) gives a 95% confidence interval for the asset 
price S(t). 

x Using Exercise 6.8, derive a 99% confidence interval for the asset price 
S(t). Does the traders’ rule-of-thumb still apply? 


6.7 Program of Chapter 6 and walkthrough 


In ch06, listed in Figure 6.2, we plot the lognormal density function for two different ø values. 


The resulting picture is similar to those in Figure 6.1. The array y1 stores the value of the density 


function at equally spaced points in x for the first set of parameter values: t = 1,S = 1,mu = 0.05 


and sigma = 0.3. We plot the curve as a red dashed line. The computation is then repeated with 


sigma = 0.5 anda blue dotted curve is drawn. 


PROGRAMMING EXERCISES 


P6.1. Adapt ch06 to give a waterfal1 plot illustrating how the lognormal density 
function varies with o for fixed t = 1. 


P6.2. Repeat programming exercise P6.1 for t varying and o fixed at 1. 
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%CHO6 Program for Chapter 6 
% 
% Plots lognormal density function. 


clf 


x = linspace(.01,4,500); 
t=1; S = 1; mu = 0.05; 


sigma = 0.3; 

tempa = ((log(x/S) - (mu-0.5*sigma’2)*t).°2)/(2*t*sigma‘2); 
tempb = x*sigma*sqrt(2* pi*t); 

y1 = exp(-tempa)./tempb; 

plot(x,y1,’r-’) 

ylim([0 1.5]) 

hold on 


sigma = 0.5; 

tempa = ((log(x/S) - (mu-0.5*sigma’2)*t).°2)/(2*t*sigma‘2); 
tempb = x*sigma*sqrt(2* pi*t); 

y2 = exp(-tempa)./tempb; 

plot(x,y2,’b:’) 


legend(’\sigma = 0.3’,’\sigma = 0.5’,1) 
title(’Lognormal density, t = 1, S=1, \mu = 0.05’) 
xlabel(’x’), ylabel(’f(x)’) 


Fig. 6.2. Program of Chapter 6: ch06.m. 


Quotes 


The authors emphasize that, 
as even the most cursory examination of the historical record reveals, 
“geometric Brownian motion’ is at best a first approximation 
to the actual movements of the price of any real stock or collection of stocks. 
Even their assumption that the governing processes are stochastic — rather than 
examples of deterministic chaos — may in time be disproved 
by sufficiently sensitive measurement techniques. 
JAMES CASE, reviewing (Mantegna and Stanley, 2000) 


The Brownian motion model is extremely popular, 

not primarily because of statistical evidence, 

but because it is only with this model that we can determine option prices exactly. 
ROBERT F. ALMGREN (Almgren, 2002) 


As a graduate student at the London School of Economics 
I was taught that stock markets were efficient. 
Broadly this means that all outstanding information about companies 
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is built into their share prices, i.e. they are always fairly valued. 
This sad fact was hammered home to students with a series of studies 
demonstrating that stock-market brokers and analysts, 
people with the very best information, 
fared no better in their stock-market selections than a monkey 
drawing a name from a hat, 
or a man throwing darts at the pages of the Wall Street Journal. 
The first implication of the so-called efficient markets theory is that 
there is no sure way to make money in the stock market 
other than trading on inside information. 
Milken, and others on Wall Street, saw that this simply was not true. 
The market, which may have been quick to digest earnings data, 
was grossly inefficient in valuing everything from the land a company owns to the pension 
fund it creates. 
MICHAEL LEWIS (Lewis, 1989) 


A trade takes place when the greediest buyer, 
afraid that prices will run away from him, 
steps up and bids a penny more. 
Or the most fearful seller, afraid of getting stuck with his merchandise, 
agrees to accept a penny less. 
ALEXANDER ELDER (Elder, 2002) 
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OUTLINE 


e computing discrete asset paths 
e timescale invariance 
e sum-of-square returns 


7.1 Computing asset paths 


Having derived the model, we may use (6.9) to generate computer simulations of 
asset prices. Suppose we wish to simulate the evolution of S(t) at certain points 
(ane with 0 = tọ < ti < t2 <--- < tg =T. We may compute values SASi 
according to 


Sig = Sje H720 iito Vit i, (7.1) 


where each &; is a sample from a N(0, 1) pseudo-random number generator. The 
resulting points (¢;, S;) form a discrete asset path. 


Computational example Figure 7.1 shows the results of such a simulation 
with 10° time points equally spaced in [0,3]. We took So = 1, u = 0.05 and 
o = 0.1. To produce the picture, we followed the usual convention of joining 
the discrete data points (t;, S;) by straight lines. Overall, the resulting picture 
agrees qualitatively with typical asset price plots, such as those in Figures 5.1 
and 5.2. Ko 


To obtain the picture in Figure 7.1 we computed a discrete, but closely spaced, 
set of data points and joined them with straight lines. The picture seems to suggest 
that the points lie on a continuous, but ‘jagged’, curve. This concept can be for- 
malized. On the one hand it can be shown that, with probability 1, an asset path 
arising from the ôt — O limit in (6.2) will be a continuous function of t. But on 
the other hand it can also be shown that, with probability 1, the path will not have 
a well-defined tangent at any point. 
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Discrete asset path 
1.5 T T T 


Fig. 7.1. Discrete asset path of the form (7.1). Discrete points are joined by 
straight lines to give the impression of a continuous curve. 


We would also expect from the original discrete model (6.2) that increasing the 
volatility parameter o should turn up the ‘jaggedness’. The next computational 


example tests for this effect. 


Computational example Figure 7.2 shows asset paths computed with the same 
parameters as for Figure 7.1, except that we set o = 0.2 in the upper picture 
and o = 0.4 in the lower picture. The same psuedo-random number sequence 
{&} was used in both cases. The results confirm that the volatility parameter o 
controls the jaggedness of the path. © 


Although individual asset paths are nonsmooth functions, we know from (6.11) 
that the mean of S(t) is smooth. This is confirmed in the next computational ex- 


ample. 


Computational example Here we take u = 0.2 and ø = 0.3 and use 10° 
equally spaced time points over [0,3]. We generated 10* such discrete paths, 
starting from So = 1 but using different random number generator samples for 
each path. The upper picture in Figure 7.3 shows the first 20 such paths. In the 
lower picture we plot the sample mean: at each time point we plot the average 
of the 10* different asset values. We see that this sample mean is indeed smooth; 


7.1 Computing asset paths 


Fig. 7.2. Two discrete asset paths of the form (7.1). Lower picture has higher 
volatility. 


Asset 


Mean 


0 0.5 1 1.5 2 2.5 3 


Fig. 7.3. Upper picture: 20 discrete asset paths. Lower picture: sample mean 
of 10* discrete asset paths. 
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0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 


Fig. 7.4. Upper picture: 50 discrete asset paths over [0, T] with So=1, u = 
0.05, o = 0.5, T = 1 and ôt = 10~*. Lower picture: histogram for S(T) from 
104 such paths, with lognormal density function (6.10) superimposed. 


it is visually indistinguishable from the exact mean Soe”! that we derived 


in (6.11). © 
We next give a test that confirms the lognormal behaviour of the asset model. 


Computational example Here, we set So = 1, u = 0.05 and o = 0.5, and com- 
puted discrete paths over [0, T], with T = 1. We used a uniform time spacing of 
ti41 — ti = ôt = 107?. The upper picture in Figure 7.4 shows 50 such paths. In 
the lower picture we give a kernel density estimate for the asset price at expiry. 
This was computed in the manner discussed in Section 4.3, using a histogram 
with 45 bins of width 0.05. The corresponding lognormal density function (6.10), 
which is superimposed as a dashed line, gives a good match. © 


7.2 Timescale invariance 


The next computational example reveals a key property of the asset price model. 
The jaggedness looks the same over a range of different timescales. In other 
words, zooming in or out of the picture, we see the same qualitative behaviour. 
We saw the same effect when we moved from daily to weekly data in Figures 5.1 
and 5.2. 
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Asset path zoom 
1.5 T 


0 0.02 0.04 0.06 0.08 0.1 


0.95 i i i i 
0.002 0.004 0.006 0.008 0.01 


Fig. 7.5. The same asset path sampled at different scales. Upper picture: 100 
samples over [0, 1]. Middle picture: 100 samples over [0, 0.1]. Lower picture: 
100 samples over [0, 0.01]. 


Computational example To generate Figure 7.5, we computed a single asset 
path for So = 1, u = 0.05 and o = 0.5 at equally spaced time points in [0, 1] a 
distance 1074 apart. Using this data, we plot three pictures. Each picture shows 
the path at 100 equally spaced time points. 


e The upper plot shows the path at 100 equally spaced points in [0, 1]. 
e The middle plot shows the path at 100 equally spaced points in [0, 0.1]. 
e The lower plot shows the path at 100 equally spaced points in [0, 0.01] 


We see that zooming in on the path in this manner does not reveal any change in 
the qualitative features — the path is ‘jagged’ at all time scales. © 


To understand why the pictures have this ‘timescale stability’ we go back to the 
discrete model (6.2) and consider 


e a small time interval ôt, 
e very small time interval ôt = 5t/L, where L is a large integer. (In Figure 7.5 we used 
quite a moderate value, L = 10.) 


Using (6.2) to get from time t = 0 to t = St we have 


S(St) — So = So(uadt + oV5t¥o) = SoN(udt, 0257) (7.2) 
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for the change in S(t). From time t = 0 to t = ôt, increments like this add up: 


L-1 L-1 
SGH) — So = Y (SE + DE) — SGS) = Y S451) (USE + oVEY;). 
i=0 i=0 


Approximating! each S (iSt) by So and using insight from the Central Limit The- 
orem suggests that 


L-1 
S (St) — So ~ So X (use 4 ov 51¥; ) = SoN(uL St, o2L5t) = SoN(uôt, 0261), 
i=0 


which reproduces (7.2) over the longer timescale. 


7.3 Sum-of-square returns 


In Section 5.3 we introduced the concept of the return of an asset; this is simply 
the relative price change. For small ôt = t;41 — ti our original discrete model (6.2) 
assumes that 

S(ti41) — S (ti) 


= uôt ôtY;, 7.3 
E ust + ov ôt (7.3) 


so the return is an N(uôt, 075) random variable. Under this model we know the 
Statistics of the return — given any numbers a and b we can work out the probability 
that the return over the next interval lies between a and b, but, of course, we cannot 
predict with any certainty what actual return will be seen. 

By contrast with the uncertainty of returns, we can show that the sum-of-square 
returns is predictable. Suppose the interval [0, t] is divided into a large number of 


equally spaced subintervals [0, t1], [t1, fo], ..., [fr-1, tt], with t; = iôt and ôt = 
t/L. Then from (7.3) it is straightforward to show that 
Sti) — S(H)\? 
E (a = o7St + higher powers of ôt, (7.4) 
S(t) 
and 


2 
var (23) l = 204817 + higher powers of ôt, (7.5) 
S (ti) 
see Exercise 7.1. 
Hence, using insight from the Central Limit Theorem, i ((S(ti+1)— 
S(t:))/S(t;))? should behave like N(Lo?ôt, L20*5t7), that is, N(o?t, 204t6t). 
This random variable has a variance proportional to ôt, and hence is essentially 


1 Some justification for this type of approximation can be found in Section 8.2. 
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Fig. 7.6. Upper pictures: asset paths. Lower pictures: running sum-of-square 
returns (7.6). 


constant. Thus, although the individual returns are unpredictable, the sum of the 
squared returns taken over a large number of small intervals is approximately equal 


to o°t. 


Computational example Figure 7.6 confirms the sum-of-square returns result. 
We use So = 1, u = 0.05 and o = 0.3. Ten asset paths over [0, 0.5] are shown 
in the upper left plot. The paths were computed using equally spaced time points 
a distance ôt = 0.5/100 = 5 x 1073 apart, so L = 100. The lower left picture 
plots the running sum-of-square returns 


k : _ SAS 
> (so sw) ae 
L St) 


against ft for each path. The sum is seen to approximate o*f; the height 
o7/2 is shown as a dotted line. The right-hand pictures repeat the experiment 
with L = 10°, so ôt =5 x 1074. We see that reducing ôt has improved the 


match. © 


7.4 Notes and references 


Our treatment of timescale invariance in Section 7.2 can be made rigorous, but the 
concepts required are beyond the scope of this book. (The essence is that if W (t) is 


70 Asset price model: Part IT 


a Brownian motion then so is W(c?t) /c, for any constant c > 0; see, for example, 
(Brzezniak and Zastawniak, 1999, Exercise 6.28) and (Brzezniak and Zastawniak, 
1999, Exercise 7.20), and their solutions, for details of this result and why it applies 
to the asset model.) 

There have been numerous attempts to develop generalizations or alternatives to 
the lognormal asset price model. Many of these are motivated by the observation 
that real market data has fat tails — extreme events occur more frequently than a 
model based on normal random variables would predict. 

One approach is to allow the volatility to be stochastic, see (Duffie, 2001; Hull, 
2000; Hull and White, 1987), for example. Another is to allow the asset to undergo 
‘jumps’, see (Duffie, 2001; Hull, 2000; Kwok, 1998), for example. Jump models 
are especially popular for modelling assets from the utility industries, such as elec- 
trical power. The article (Cyganowski et al., 2002) discusses some implementation 
issues. 

An alternative is to take a general, parametrized class of random variables and 
fit the parameters to stock market data, see (Rogers and Zane, 1999), for example. 

A completely different approach is to abandon any attempt to understand the 
processes that drive asset prices (in particular to pay no heed to the efficient mar- 
ket hypothesis) and instead to test as many models as possible on real market data, 
and use whatever works best as a predictive tool. A group of mathematical physi- 
cists with expertise in chaos and nonlinear time series, led by Doyne Farmer and 
Norman Packard, took up this idea. They founded The Prediction Company in 
Santa Fe. The company has a website at www.predict.com/html/ introduction.html 
which makes the claim that 


Our technology allows us to build fully automated trading systems which can handle huge 
amounts of data, react and make decisions based on that data and execute transactions 
based on those decisions — all in real time. Our science allows us to build accurate and 
consistent predictive models of markets and the behavior of financial instruments traded in 
those markets. 


The book (Bass, 1999) gives the story behind the foundation and early years of the 
company and has many insights into the practical issues involved in collecting and 
analysing vast amounts of financial data. 


EXERCISES 


7.1. xxx Confirm the results (7.4) and (7.5). 

7.2. xx By analogy with the continuously compounded interest rate model, we 
may define the continuously compounded rate of return for an asset over 
[0, t] to be the random variable R satisfying S(t) = Soe". Using (6.8), show 
that R ~ N(u — 07/2, 07/t). 
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7.5 Program of Chapter 7 and walkthrough 


The program ch07, listed in Figure 7.7, produces a plot of 50 asset paths in the style of the upper pic- 
ture in Figure 7.4. Having initialized the parameters, we make use of the cumulative product function, 
cumprod, to produce an array of asset paths. Generally, given an M by L array X, cumprod(X) cre- 
ates an M by L array whose (i, j) element is the product X(1,j)*X(2, j)*X(3,j)*...*X(i,j). 
Supplying a second argument set to 2 causes the cumulative product to be taken along the sec- 
ond index — across rows rather than down columns, so cumprod(X,2) creates an M by L array 
whose (i, j) element is the product X(i,1)*X(i,2)*X(i,3)*...*X(i,j). We also supply two 
arguments to the randn function: randn(M,L) produces an M by L array with elements from the 
randn pseudo-random number generator. 
It follows that 


Svals = S*cumprod(exp((mu-0.5+*sigma*2)*dt + sigma*sqrt (dt) *randn(M,L)),2); 


creates an M by L array whose ith row represents a single discrete asset path, as in (6.9). The next 
line 


Svals = [S*ones(M,1) Svals]; % add initial asset price 


adds the initial asset as a first column, so that the ith row Svals(i,1),Svals(i,2),..., 
Svals(i,L+1) represents the asset path at times 0,dt,2dt,3dt,...,T. 


PROGRAMMING EXERCISES 


P7.1. Write a program that illustrates the timescale invariance of the asset model, 
in the style of Figure 7.5. 


P7.2. Use mean and std to verify the approximations (7.4) and (7.5) for (7.3). 


%CHO7 Program for Chapter 7 
% 
% Plot discrete sample paths 


randn(‘state’,100) 
clf 


YHKWAVWWV%Y% Problem parameters %W%%%%%PA% AAA 
S = 1; mu = 0.05; sigma = 0.5; L = 1e2; T = 1; dt = T/L; M = 50; 
Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo 


tvals = [0:dt:T]; 

Svals = S*cumprod(exp((mu-0.5*sigma^2)*dt + sigma*sqrt(dt)*randn(M,L)),2); 
Svals = [S*ones(M,1) Svals]; % add initial asset price 

plot(tvals,Svals) 

title("50 asset paths’) 

xlabel('t’), ylabel(’S(t)’) 


Fig. 7.7. Program of Chapter 7: chO7 .m. 


72 Asset price model: Part IT 


Quotes 


But as a warning, 
let me note that a trader with a better model might still not be able to transform 
this knowledge into money. 
Finance is consistent in its ability to build good models 
and consistent in its inability to make easy money. 
The purpose of the model is to understand the factors 
that influence and move option prices 
but in the absence of an ability to forecast these factors 
the transformation into money remains non-trivial. 
DILIP B. MADAN (Madan, 2001) 


Evidence countering the efficient market hypothesis 

comes in the form of stock market anomalies. 

These are events that violate the assumption that stock returns 

are randomly distributed. 

They include the size effect 

(big-company stocks out-perform small-company stocks or vice versa); 

the January effect 

(stock returns are abnormally high during the first few days of January); 

the week-of-the-month effect 

(the market goes up at the beginning and down at the end of the month); 

and the hour-of-the-day effect 

(prices drop during the first hour of trading on Monday and rise on other days). 

Prices fall faster than they rise; 

the market suffers from ‘roundaphobia’ 

(the Dow breaking ten thousand is a big deal); 

and the market tends to overreact 

(aggressive buying after good news is followed by nervous selling, 

no matter what the news). 

Finally, the efficient market hypothesis is incapable of explaining 

stock market bubbles and crashes, insider trading, monopolies, 

and all the other messy stuff that happens outside its perfect models. 
THOMAS A. BASS (Bass, 1999) 


Prices reflect intelligent behavior of rational investors and traders, 
but they also reflect screaming mass hysteria. 
ALEXANDER ELDER (Elder, 2002) 


8 
Black—Scholes PDE and formulas 


OUTLINE 


e sum-of-squares for asset price 

e replicating portfolio 

e hedging 

e Black-Scholes PDE 

e Black-Scholes formulas for a European call and put 


8.1 Motivation 


At this stage we have defined what we mean by a European call or put option on 
an underlying asset and we have developed a model for the asset price movement. 
We are ready to address the key question: what is an option worth? More precisely, 


can we systematically determine a fair value of the option at t = 0? 


The answer, of course, is yes, if we agree upon various assumptions. Although 
our basic aim is to value an option at time t = 0 with asset price $(0) = So, we 
will look for a function V (S, t) that gives the option value for any asset price S > 0 
at any time 0 < t < T. Moreover, we assume that the option may be bought and 
sold at this value in the market at any time 0 < t < T. In this setting, V (So, 0) is 
the required time-zero option value. We are going to assume that such a function 
V(S,t) exists and is smooth in both variables, in the sense that derivatives with 
respect to these variables exist. It was mentioned in Section 7.1 that S(t) is not a 
smooth function of t — it is jagged, without a well-defined first derivative. However, 
it is still perfectly possible for the option value V (S, t) to be smooth in S and t. 
Looking ahead, Figures 11.3 and 11.4 illustrate this fundamental disparity. 

Our analysis will lead us to the celebrated Black-Scholes partial differential 
equation (PDE) for the function V. The approach is quite general and the PDE 
is valid in particular for the cases where V (S,t) corresponds to the value of a 
European call or put. 
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The key idea in this chapter is hedging to eliminate risk. To reinforce the idea, 
and emphasize that it is a concrete tool as well as a theoretical device, the next 
chapter is devoted to computational experiments that illustrate hedging in practice. 

Before launching into a description of hedging, we first introduce one of the 
main ingredients that goes into the analysis. 


8.2 Sum-of-square increments for asset price 


To make progress, we need to work on two timescales. For the rest of the chapter 
we use 


e a small timescale, determined by a time increment Ar, and 
e a very small timescale, determined by a time increment ôt = At/L, where L is a large 
integer. 


We consider some general time ¢ € [0, T ] and general asset price S(t) > 0, and fo- 
cus on the small time interval [t, t + At]. This is broken down into equally spaced, 


very small, subintervals of length ôt, giving [fo, t1], [t1, t2], ..., [tz-1, tL] with 
to = t, tL = t + At and, generally, t; = t + iôt. 
We will let 


6S; := S(ti+1) — SG) 


denote the change in asset price over a very small time increment. Before attempt- 
ing to derive the Black-Scholes PDE, we need to establish a preliminary result 
about the sum-of-square increments, ay as}. A similar analysis was done in 
Section 7.3 for the sum-of-square returns, sar 9 (OS; /S (t;))?. 

Returning to the discrete model (6.2) we ave 


8S; = S(t) (ut +o V5tY;), 


where the Y; are i.i.d. N(O, 1). So 


Yast = 2o (281? +2uoôt?Y; +a76tY?). (8.1) 
i=0 


We now make this summation amenable to the Central Limit Theorem by replacing 
each S(t;) by S(t). This approximation, which is discussed further in the next 
paragraph, gives us 


L-1 L-1 
> 5S? = S(t)? X ae? +2uoôt?Y; + o°5tY?). (8.2) 
i=0 i=0 
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Working out the mean and variance of the random variables inside the summation 
and appealing to the Central Limit Theorem suggests the approximate relation 


L-1 

XO 887 ~ SPN? Lât, 204L8t) = S(t)N(o At, 204 Atôt), (8.3) 

i=0 
see Exercise 8.1. Because ôt is very small, the variance of that final expression is 
tiny, leading us to conclude that the sum-of-square increments is approximately a 
constant multiple of S (t)?: 


L-1 
XOS? ~ S(t)o7 At. (8.4) 
i=0 
The step of replacing each S(t;) in (8.1) by S(t) can be loosely justified as 
follows. Our model (6.9) shows that 


S(t) = Sje 3? dtto VIBTZ for some Z ~ N(O, 1). 
Using e* ~ 1 + x for small x, we have 
St) © SHOA + oVidstZ) 
and since iôt < Lôt = At, we may write, loosely, 
St) — S(t) = OWAN). 


In words, approximating each S(¢;) by S(t) introduces an error that is roughly 
proportional to /Ar. We may thus argue that replacing each S(¢;) in (8.1) with 
S(t) will not affect the leading term in the approximation (8.4). This is far from a 
rigorous argument — Z is a random variable, not simply a real number — but it can 
be shown that the overall conclusion is valid. 


Computational example Although we are not in a position to prove (8.4) rigo- 
rously, we can certainly illustrate the result via a computational experiment. We 
may copy the way that Figure 7.6 was produced, but now computing the sum- 
of-square increments, instead of the sum-of-square returns. We set So = 1, u = 
0.05 and o = 0.3. The upper left plot in Figure 8.1 shows ten discrete asset 
paths over [0, At] with At = 0.5, using equally spaced points a distance ôt = 
At/100 = 5 x 107° apart. So L = 100 and t = 0. The lower left picture plots 
the running sum-of-square increments 


k 
>> 68? (8.5) 
i=l 
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Fig. 8.1. Upper pictures: asset paths. Lower pictures: running sum-of-square 
increments (8.5). 


against tg for each path. We see that the sum typically approximates o? At = 
0.045 as k approaches L. The right-hand pictures give the same information for 
an example with Ar = 0.1 and L = 1000, so ôt = 1074. We see that the quality 
of the approximation (8.4) has improved. © 


8.3 Hedging 


Now, to find a fair option value, we set up a replicating portfolio of asset and cash, 
that is, a combination of asset and cash that has precisely the same risk as the 
option at all time. The portfolio will consist of a cash deposit D and a number A 
of units of asset. We allow D and A to be functions of asset price S and time t. 
The portfolio value, denoted by TI, thus satisfies 


TI(S, t) = A(S,t)S + D(S,f). (8.6) 


We must specify how the asset holding A(S,¢) and cash deposit D (S, t) are 
going to vary with S and t. Before delving into the details it is perhaps useful to 
remind ourselves of some basic assumptions that are being made, all of which have 
been introduced earlier: 


e there are no transaction costs, 
e the asset can be bought/sold in arbitrary units, 
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e short selling is permitted, 

e no dividends are paid, 

e the interest rate r is constant, 

e trading of the asset (and option) can take place in continuous time. 


To avoid unreadably long equations we will also introduce some shorthand nota- 
tion. A subscript i denotes evaluation of a function at (S (t;), ti), so 
V; means V (S(¢;), ti), II; means TI (S(t;), ti), etc. 
No subscript denotes evaluation at (S(t), t), so 
V means V (S(t), t), II means TI (S(t), t), etc. 


The symbol ô denotes the difference over a timestep of length ôt, so 


e ôS; means S(t;4+1) — SG), 

e ôV; means V (S(ti+1), ti+1) — VSG), ti), 
e OTT; means II (S(ti+1), ti+1) — TI(S (ti), ti), 
e ô(V — II); means ôV; — ôIl;, etc. 


Our strategy for the portfolio (8.6) is to keep the amount of asset constant over 
each very small timestep of length ôt. It follows that the change in the value of the 
portfolio has two sources. 


(1) The asset price fluctuation. The change ôS; produces a change A;ôS; in the portfolio 
value. 

(2) Interest accrued on the cash deposit. Using the discrete version for convenience (see 
(2.7) in Exercise 2.2), we may write this contribution to the portfolio change as r D;ôt. 


Overall, 
ôI; = Ajo$; +r Diôt. (8.7) 


Now because V is assumed to be a smooth function of § and t, a Taylor series 
expansion gives 
OV; ð 


V; 02V; 
Vi & ôt E S 
at 5 as a as2 


oS (8.8) 


We have kept the 6S z term in (8.8) because experience from the previous two chap- 
ters suggests that it will make a contribution of size proportional to ôt. Subtracting 
(8.7) from (8.8) in order to compare the change in the portfolio with that in the 
option value, we find 


OV; OV; a°Vj 
ôV — I); ~ ( at = rs) ôt + ( 55 = Ai) oS; + 5 352 5S?, (8.9) 
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Our aim is to make the portfolio replicate the option, so that the difference 
between them is predictable. We can eliminate the unpredictable ôS; term from 
(8.9) by setting 


OV; 
Aj ; 8.10 
55 (8.10) 
in which case 
OV; a2V; 
ôV — I); © f= 7 De |) Fs 8.11 
V-m) (= ) +355 (8.11) 


The final step in eliminating randomness is to add these differences over 0 <i < 
L — 1 and exploit (8.4), which shows that the sum of the ôS 4 terms is nonrandom. 

Before proceeding with that final step, we pause to explain what (8.10) means 
in practice. If we are able to find the required function V, then we may differ- 
entiate it with respect to S in order to specify our strategy for updating the port- 
folio. At the end of the step from t; to ¢;;; we rebalance our asset holding to 
Aj+1 = 0V;+41/0S. This may involve selling (if 0Vi+1/0S < ƏV; /ƏS) or buying 
(if 0Aj41/0S > 0A;/0S) some amount of the asset. We want to make the portfolio 
self-financing, that is, beyond time t = 0 we do not want to add or remove money. 
This can be achieved by using the cash account to finance the update — the money 
needed for, or generated by, the asset rebalancing, is reflected by a corresponding 
change from D; to Di+1. This idea of continually fine-tuning the portfolio in order 
to reduce or remove risk is known as hedging. 


8.4 Black-Scholes PDE 
Letting A(V — IT) denote the change in V — II from time ż to t + Af, that is, 
AW -ID =V(S(¢ + At), ¢ + At) — (S(t + At), t + At) 
— WV (S(t), t) — TICS), t)), 
we may sum (8.11) to give 


aV; 
A(V —T) & 2o , pia iY east (8.12) 


i=0 


On the basis that V and D are smooth functions, we will replace the arguments 
S(t), ti in V;/dt, D; and 47V; /9S7, by S(t), t, ina similar manner to the approx- 
imation used for (8.1). So, using Lôt = At, 


av pervs 
A(V —T1) ® sD At+4 Fo Dask 
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Now, using (8.4), and assuming that all approximations are exact in the limit ôt > 
0, we may write 


av resco V 

The final leap of logic is to argue that because this change in the portfolio V — TI 
is nonrandom, it must equal the corresponding growth offered by the risk-free 
interest rate, so 


A(V =- T) =rAt(V - 0). (8.14) 


This follows from the no arbitrage principle. If A(V — II) > rAt(V — II) then 
we could make a guaranteed profit greater than that offered by the risk-free interest 
rate by 


(i) acquiring the portfolio V — IT at time rt — buying the option at V in the marketplace, 
and selling the portfolio IT (i.e. short selling A units of asset and loaning out an amount 
D of cash), and 

(ii) selling the portfolio V — TI at time t + At. 


Similarly, if A(V — II) < rAt(V — II) then we could make a guaranteed profit 
greater than that offered by the risk-free interest rate by 


(i) selling the portfolio V — T at time t — selling the option at V in the marketplace, and 
buying the portfolio II (i.e. buying A units of asset and borrowing an amount D of 
cash), and 

(ii) buying the portfolio V — TI at time t + At. 


Now, combining (8.6), (8.13) and (8.14) gives 


ave 1.2099°V 


Using A = 0V/0S from (8.10) and rearranging, we arrive at 
E ie E, (8.15) 
o r -rV =0. ; 
2 0S? as 


This is the famous Black-Scholes partial differential equation (PDE). It is a rela- 
tionship between V, S, ¢ and certain partial derivatives of V. 


Two points are worth raising immediately. 


(1) The drift parameter u in the asset model does not appear in the PDE. 

(2) We have not yet specified what type of option is being valued. The PDE must be sat- 
isfied for any option on § whose value can be expressed as some smooth function 
V(S, ¢). 
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Regarding point (2), to determine V (S, t) uniquely we must specify other condi- 
tions that involve information about the particular option. As is typical with many 
differential equations, these will apply somewhere along the edges of the domain 
0 < S,0 <t <T on which the problem is posed. 

We will use C (S, t) to denote the European call option value. In this case, we 
know for certain that at the expiry time, t = T, the payoff is max(S(T) — E, 0). 
This must be the value of the option at time T, otherwise an obvious arbitrage 
opportunity exists. So 


C(S, 7) = max(S(T) — E, 0). (8.16) 


Now if the asset price is ever zero, then it is clear from (6.9) that S(t) remains zero 
for all time and hence the payoff will be zero at expiry. So, in this case, the value 
of the option must be zero at all times. Hence, 


C(0,t)=0, forallO<t<T. (8.17) 


Conversely, if the asset price is ever extremely large, then it is very likely to remain 
extremely large and swamp the exercise price, so that, 


C(S,t) +S,  forlarge S. (8.18) 


The constraint (8.16) is called a final condition, as it applies at the final time 
t = T. It is much more common to come across initial conditions, specified at 
t = 0, and we will see in Chapter 24 that the PDE is easily transformed into such 
a problem. The other constraints, (8.17) and (8.18), are known as boundary condi- 
tions. 


8.5 Black-Scholes formulas 


Imposing (8.16), (8.17) and (8.18) on the Black-Scholes PDE (8.15) is enough to 
force a unique solution to exist for the call option value. (In fact we could get away 
with less boundary information, see Section 8.6.) This solution is 


C(S,t) =SN(d\) — Ee"? ON (d2), (8.19) 
where N (-) is the N(O, 1) distribution function, defined in (3.18), and 


d log(S/E) + (r + 50°)(T — t) 
1= 


2 
EIT ($20) 
il Am 
pE log(S/E) + r — 40°)(T D ror 


ONT -t 
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We may also write 
dy = dı —ovT —t, (8.22) 


see Exercise 8.2. The equation (8.19) displays the Black-Scholes formula for the 
value of a European call. It is possible to construct the formula by solving the PDE 
(8.15) under (8.16), (8.17) and (8.18). In this book, we take the easier route of 
verifying directly that C (S, t) in (8.19) has the right properties. Exercise 8.3 deals 
with (8.16), (8.17) and (8.18), and Section 10.4 deals with the PDE (8.15). 

Having obtained a formula for a European call option value, we may exploit 
put—call parity to establish the value P(S,¢) of a European put option. In Sec- 
tion 2.5 we derived the relation (2.2) that connects the time-zero call and put val- 
ues. Letting P (S,t) denote the put value at asset price S and time t, the same 
argument gives the general put—call parity relation 


C(S,t) + Ee’? = P(S,t)+ S, (8.23) 


see Exercise 8.4. Combining (8.19) and (8.23) leads to the Black-Scholes formula 
for the value of a European put option, 


P(S,t) = Ee’? (1 — N(da)) + S (N (di) — 1). 
Using Exercise 3.9, this may be simplified to 
P(S,t) = Ee! N (—d2) — SN(—d)). (8.24) 


Alternatively, we could derive final time and boundary conditions and attempt to 
solve the Black-Scholes PDE. Since the payoff for a put option at time t = T is 
max(E — S(T), 0), we have 


P(S,T) = max(E — S(T), 0). (8.25) 


If the asset price is ever zero then S(T) = 0 and the payoff at time T will be E. To 
obtain P (0, t) we discount for inflation, to get 


P(0,t) = Ee "T9, forallO0 <1¢ <T. (8.26) 
For extremely large S the payoff is almost certain to be zero, so 
P(S,t) +0, for large S. (8.27) 


Exercise 8.5 asks you to confirm that P(S, +t) in (8.24) satisfies the conditions 
(8.25)(8.27) and in Exercise 10.7 of Chapter 10 you are set the task of showing 
that it solves the Black-Scholes PDE (8.15). 


Computational example For illustration, we give a simple example of evalu- 
ating the Black-Scholes formulas. With t = 0, So = 5, E =4,T =1,0 =0.3 
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andr = 0.05, we find, to four decimal places, 


dı = 1.0605, 

d2 = 0.7605, 

N (dı) = 0.8555, 
N (d2) = 0.7765, 
N (—d;) = 0.1445, 
N (—d2) = 0.2235. 


Here, we used MATLAB’s erf function in order to evaluate N (x) — see Exer- 
cise 4.1. The resulting European call and put option values are 


C(5,0) = 1.3231 and P(5,0) = 0.1280. 


The put—call parity relation (2.2) is easily confirmed. © 


8.6 Notes and references 


The two classic references for the Black—Scholes theory are the paper (Black and 
Scholes, 1973) by Fischer Black and Myron S. Scholes, which derives the key 
equations, and the paper (Merton, 1973) by Robert C. Merton, which adds a rig- 
orous mathematical analysis. Merton and Scholes were awarded the 1997 Nobel 
Prize in Economic Sciences for this work. It is widely accepted that Fischer Black, 
who died in 1995, would have shared in the prize had he still been alive. Details of 
the prize can be found at www.nobel.se/economics/laureates/1997/. 

The accompanying press release argues that 


A new method to determine the value of derivatives stands out 
among the foremost contributions to economic sciences 
over the last 25 years. 


The heuristic, discrete-time treatment of hedging that we used to derive the 
Black-Scholes PDE was inspired by the expository article of Almgren (Almgren, 
2002). Modern texts that give rigorous derivations of the Black-Scholes formula 
include (Björk, 1998; Duffie, 2001; Karatzas and Shreve, 1998; Nielsen, 1999; 
Øksendal, 1998). 

It is possible to weaken the boundary conditions (8.17) and (8.18) in the Black— 
Scholes PDE (8.15) without sacrificing uniqueness of the solution. Some control 
on the growth of the solution as $ — oo would suffice, see for example (Wilmott 
et al., 1995). We will return to the issue of boundary conditions when we discuss 
finite difference methods in Chapters 23 and 24. 
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As a final comment, we note that although the time-T call value is a nonsmooth 
hockey stick, (8.16), the function C(S,¢) is smooth at all times 0 < t < T; this 
phenomenon of ‘instant smoothing’ is typical of diffusion PDEs like (8.15). 


8.1. 


8.2. 


8.3. 


8.4. 


8.5. 


8.6. 


8.7. 


8.8. 


8.9. 


8.10. 


8.11. 


EXERCISES 


xx Show that (8.2) leads to the approximate relation (8.3). [Hint: use Exer- 
cise 3.7.] 

x Show that (8.21) can be replaced by (8.22). 

x x x Confirm that C (S, t) in (8.19) satisfies (8.16), (8.17) and (8.18). [Hint: 
to deal with (8.16), take the limit t —> T7, to deal with (8.17) take the limit 
S — 07 and to deal with (8.18) take the limit S > œ.] 

xx Use the argument in Section 2.5 to obtain the general put—call parity 
relation (8.23). 

x x x Confirm that P (S, t) in (8.24) satisfies (8.25)—(8.27). 

xx It is intuitively obvious that call and put options are linear — the value of 
two options is twice the value of one option. Show how this follows from 
the Black-Scholes formulas (8.19) and (8.24). 

x x x Show that limg_-.9 C(S, t) = S in (8.19) and limg-_,9 P (S, t) = 0 in 
(8.24), and give a financial interpretation of the results. 

x x x Write down a PDE and final time/boundary conditions for the value 
of a butterfly spread, as described in Exercise 1.3. 

x x x Verify that 


e(o°—2r)(T—1) 
S 


is a solution of the Black-Scholes PDE (8.15). What is the practical impli- 
cation of this result? 

x x x Verify that S and e” are solutions of the Black-Scholes PDE (8.15) 
and give an accompanying financial explanation. 

x x x Consider the problem posed in Exercise 2.6 of finding a fair value for 
a forward contract. Use Exercise 8.7 above to confirm that F = S(O)e’?. 


V(S,t) = 


8.7 Program of Chapter 8 and walkthrough 


Unlike the previous seven cases, our code for this chapter, which is listed in Figure 8.2, is a MATLAB 


function. This means that it must be supplied with input arguments and it will return output argu- 


ments. The input arguments S,E,r,sigma and tau represent, respectively, the asset price at time f, 


the exercise price, the interest rate, the volatility and the time to expiry, T — t. It is assumed that tau 


is non-negative. 
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fu 
% 
% 
% 
% 
% 
% 
% 
% 
% 
% 
% 
% 
% 


if 


el 


nction [C, Cdelta, P, Pdelta] = ch08(S,E,r,sigma,tau) 
Program for Chapter 8 
This is a MATLAB function 


Input arguments: S = asset price at time t 
E = Exercise price 
r = interest rate 
sigma = volatility 
tau = time to expiry (T-t) 


Output arguments: C = call value, Cdelta = delta value of call 
P = Put value, Pdelta = delta value of put 


function [C, Cdelta, P, Pdelta] = ch08(S,E,r,sigma,tau) 


tau > 0 

d1 = (log(S/E) + (r + 0.5*sigma’2)*(tau))/(sigma*sqrt(tau)); 
d2 = d1 - sigma*sqrt(tau); 

N1 = 0.5*(1+erf(d1/sqrt(2))); 
N2 = 0.5*(1+erf(d2/sqrt(2))); 
C = S*N1-E*exp(-r*(tau))*N2; 
Cdelta = N1; 

P = C + E*exp(-r*tau) - S; 
Pdelta = Cdelta - 1; 

se 

C = max(S-E,0); 

Cdelta = 0.5*(sign(S-E) + 1); 
P = max(E-S,0); 

Pdelta = Cdelta - 1; 


end 


Fig. 8.2. Program of Chapter 8: ch08.m. 


The output arguments C,Cdelta,P and Pdelta represent, respectively, the European call, call 


delta, put and put delta values. 


to 


N 


Here, we are at expiry and to avoid division by zero errors in (8.20) and (8.22), we revert to the 
expressions (8.16), (8.25), along with (9.7) and (9.8) from Chapter 9. We make use of the signum 


fu 


The lines of code between if tau > O and else are executed in the case where tau, the time 
expiry, is positive. In this case we are evaluating the Black-Scholes values given by (8.19), (8.24), 
and also the deltas (9.1) and (9.2) that are introduced in Chapter 9, using erf as a means to obtain 


(x), as described in Exercise 4.1. 


The lines of code between else and end are executed in the remaining case, where tau is zero. 


nction, sign, which is defined by 


1, ifx > 0, 
sign(x) = 0, if x = 0, 
—l, ifx <0. 
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An example of the function in use is 


>> S = 2; E = 2.5; r = 0.03; sigma = 0.25; tau = 1; 
>> [C, Cdelta, P, Pdelta] = ch08(S,E,r,sigma,tau) 


which outputs 


C = 0.0691 
Cdelta = 0.2586 
P = 0.4953 


Pdelta = -0.7414 


PROGRAMMING EXERCISES 


P8.1. Use ch08.m to produce graphs illustrating the limits lim,_,7- C (S, t) = 
max(S (T) — E, 0) and lims_.o0 C (S, t) = S established in Exercise 8.3. 
P8.2. Write a program that illustrates (8.4) in the style of Figure 8.1. 


Quotes 


Stephen Belloti: “Myron, what do you have more of — money or brains?’ 
Myron Scholes: ‘Brains, but it’s getting close.’ 
Source (Lowenstein, 2001) 


In the early 1970s, Merton tackled a problem 
that had been partially solved by two other economists, 
Fischer Black and Myron S. Scholes: 
deriving a formula for the ‘correct’ price of a stock option. 
Grasping the intimate relation between an option and the underlying stock, 
Merton completed the puzzle with an elegantly mathematical flourish. 
Then he graciously waited to publish until after his peers did; 
thus the formula would ever be known as the Black-Scholes model. 
Few people would have cared given that no active market for options existed. 
But coincidentally, a month before the formula appeared, 
the Chicago Board Options Exchange had begun to list stock options for trading. 
Soon, Texas Instruments was advertising in The Wall Street Journal, 
‘Now you can find the Black-Scholes value using our ... calculator.’ 
This was the true beginning of the derivatives revolution. 
Never before had professors made such an impact on Wall Street. 
ROGER LOWENSTEIN (Lowenstein, 2001) 


In 1975 I crammed the Black-Scholes formula into a TI-52 handheld calculator, 
which was capable of giving me one option price in about thirteen seconds. 
It was pretty crude, 
but in the land of the blind I was the guy with one eye. 
JOE RITCHIE, option trader, source (Bass, 1999) 
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To someone who came out of graduate school in the mid-eighties, 
the decade spanning roughly 1969-79 
seems like a golden age of dynamic asset pricing theory ... 
The Black-Scholes model now seems to be, by far, 
the most important single breakthrough of this ‘golden decade’ ... 
Theoretical developments in the period since 1979, with relatively few exceptions, 
have been a mopping-up operation. 
DARRELL DUFFIE (Duffie, 2001) 


9 
More on hedging 


OUTLINE 


e practical illustration of hedging 
e behaviour of delta near expiry 
e Long-Term Capital Management 


9.1 Motivation 


The hedging idea that was used to derive the Black-Scholes PDE forms the most 
important concept in this book. In this chapter, we therefore take time out to re- 
iterate the steps involved and develop the process into an algorithm that can be 
illustrated numerically. 


9.2 Discrete hedging 


Having found the explicit formulas (8.19) and (8.24), we may differentiate with 
respect to S to obtain the required asset holding A; in (8.10). This partial derivative 
dV /dS is called the delta of an option, and the hedging strategy that we discussed 
is known as delta hedging. Performing the differentiation leads to 


aC 
aS = N (dı) (delta of a European call), (9.1) 
and 
oP 
aS = N(dı)— 1 (delta of a European put). (9.2) 


Confirmation of these expressions is deferred until Chapter 10, where various par- 
tial derivatives are computed. 

Returning to the delta hedging process, we know from (8.7) that I;+1, the value 
of the portfolio at t; + ôt, satisfies 


IM; = AjSj41 + A +r dt)Dj. (9.3) 
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The asset holding is rebalanced to Aj; and in order to compensate, the cash ac- 
count is altered to Dj). Since no money enters or leaves the system, the new 
portfolio value, Aj+1S;+,; + Dj+1, must equal IT;+1 in (9.3), so 


Dj41 = (1 +rôt)Di + (Ai — Ai+1)Si+1- (9.4) 


We may summarize the overall hedging strategy as follows. 


Set Ap = 0Vo/0S, Do = 1 (arbitrary), Ilo = AoSo + Do 
For each new time t = (i + 1)ét 
Observe new asset price $;+1 
Compute new portfolio value IT;+1 in (9.3) 
Compute Aj+1 = ove 
Compute new cash holding D;+, in (9.4) 
New portfolio value is Aj+15j;41 + Dj+1 


end 


More precisely, this strategy is discrete hedging as the rebalancing act is done 
at times iôt. Because we cannot let ôt — 0 in practice, there will be some error in 
the risk elimination. 

For the purpose of illustration, it is possible to simulate an asset path and im- 
plement discrete hedging. To write down the resulting algorithm, we use {&;} to 
denote samples from an N(0, 1) pseudo-random number generator that are used in 
simulating the asset path, and we let ôt = T/N. 


Set Ap = 0Vo0/0S, Do = 1 (arbitrary), Ilo = AoSo + Do 
Fori = 0toN — 1 
Compute $;41 = Spee 20 )8t4 VBI 
Set Tj41 = Ai Si+1 + 1 +rôt)Di 
Compute Aj41 = it 
Set Di41 = (1 + rdt)D; + (Ai — Ai+1)Si41 


end 


To describe the next set of experiments, it is convenient to use some financial 
jargon. At time f, a European call option is said to be 


in-the-money if S(t) > E, 
out-of-the-money if S(t) < E, and 
at-the-money if S(t) = E. 


The jargon extends in an obvious fashion to other options. In general, in-the-money 
means that there will be a positive payoff if the asset price stays as it is. Out-of- 
the-money means that the asset must change by some non-negligible amount in 
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order for a positive payoff to ensue. At-the-money defines the boundary between 
in- and out-of-the-money. 


Computational example Here we implement the discrete hedging simulation 
above for a European call option with Sg = 1, E = 1.5, u = 0.055, r = 0.05, 
T =5 and ôt = 10-7, so N = 500. The upper plot in Figure 9.1 displays the 
particular discrete asset path (t;, S;), for t; = iôt, that arose. The strike price 
E is shown as a dashed line. We see that for this particular asset path, the call 
option stays out-of-the-money (asset price below E) until just after t = 1, and 
then makes a number of excursions in/out-of-the-money before giving a very 
small payoff at expiry. The upper-middle plot shows the deltas, (t;, 0C;/0S), 
along the asset path. This shows the time-varying amount of asset held in the 
portfolio. The lower-middle plot gives the cash level (¢;, D;) and the solid curve 
in the lower plot gives the portfolio value (¢;, T;). The idea behind delta hedging 
is to guarantee that the portfolio C — II grows at the risk-free interest rate. It 
follows that 


T(S), © = CS), t) — (C(So, 0) — H(So, 0)) e” (9.5) 


should hold. To test this, we computed the right-hand side of (9.5) at each time t;, 
using the Black-Scholes formula (8.19) to compute C (S;, t;). Every tenth value 
has been plotted as a circle in the lower picture.' The circles appear to lie on top 
of the IT; curve, so (9.5) is approximated well. The discrepancy in (9.5) at the 
expiry date, 


C(S(T), T) — (S(T), T) — (C (So, 0) — T(So, 0) e7], (9.6) 


was found to be 0.0364. Reducing ôt to 1074 (and hence computing a different 
asset path), we found that this discrepancy was lowered to 0.0029. © 


Computational example In Figure 9.2 we repeat the computation in Figure 9.1 
with E set to the value 2.5. In this case the option finishes out-of-the-money. 
Again we observe from the lower picture that (9.5) is close to being exact. <> 


9.3 Delta at expiry 
Looking carefully at Figures 9.1 and 9.2 we see that 


e inthe first experiment, where the option expires in-the-money, the delta approaches the 
value | at expiry, whereas 


1 Plotting every value would make the picture too cluttered. 
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Fig. 9.1. Discrete hedging simulation. Option expires in-the-money. Upper: dis- 
crete asset path. Upper-middle: delta values (also asset holding in portfolio). 
Lower-middle: cash holding in portfolio. Lower: portfolio value (solid), theoreti- 
cal portfolio value (9.5) (circles). 


e in the second experiment, where the option expires out-of-the-money, the delta ap- 
proaches the value 0 at expiry. 


This is no accident. Using the characterization (9.1), some analysis shows that 


1, if S(T) > E, 
aC (S,t 
lim ce =41, if S(T) =E, (9.7) 
E 0, if S(T) < E, 


see Exercise 9.3. Hence, the delta always finishes at 1 for options that expire in-the- 
money and 0 for options that expire out-of-the-money. If S(t) ~ E for times close 
to expiry, then the delta is liable to swing wildly between values at ~ 1 (when S(t) 
goes above E) and ~ 0 (when S(t) dips below E). Our next experiment illustrates 
this effect. 


Computational example Here we repeat the computation that produced 
Figures 9.1 and 9.2 with the strike price reset to E = 1.9, so that the option 
frequently jumps in/out-of-the-money near expiry. Figure 9.3 shows that the cor- 
responding delta value lurches dramatically as expiry is approached. © 
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Portfolio 


Fig. 9.2. Discrete hedging simulation. Option expires out-of-the-money. Upper: 
discrete asset path. Upper-middle: delta values (also asset holding in portfolio). 
Lower-middle: cash holding in portfolio. Lower: portfolio value (solid), theoreti- 
cal portfolio value (9.5) (circles). 


The delta behaviour near expiry that was observed in Figures 9.1 to 9.3, and is 
encapsulated in (9.7), has a simple financial interpretation. For t ~ T there is little 
time left for the asset value to change — if it is currently in/out-of-the-money then it 
will probably remain in/out-of-the-money. In particular, if the call option is in-the- 
money then any upward or downward movement in the asset corresponds almost 
directly to the same upward or downward movement in the payoff. In other words, 
the call option and the asset are very highly correlated — they share the same risk. 
Since the portfolio is designed to replicate the risk in the option, it follows that it 
will hold approximately 1 unit of asset, so A; œ% 1. Conversely, if the call option is 
out-of-the-money close to expiry then the payoff is very likely to be zero whatever 
happens to the asset — there is no risk, so we should not be holding any asset. 

The analogous results to (9.7) for a European put option are 


0, if S(T) > E, 
aP(S,t 

lim oe = 4-4, if S(T) = E, (9.8) 
Ea -1,  ifS(T)<E, 


see Exercise 9.4, and a similar financial argument applies, see Exercise 9.5. 
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Fig. 9.3. Discrete hedging simulation. Option expires almost at-the-money. Up- 
per: discrete asset path. Upper-middle: delta values (also asset holding in port- 
folio). Lower-middle: cash holding in portfolio. Lower: portfolio value (solid), 
theoretical portfolio value (9.5) (circles). 


9.4 Large-scale test 


We finish with an experiment that looks at the success of discrete hedging over a 
large number of sample paths, and also illustrates that the option value is indepen- 
dent of the drift parameter, jz, in the asset price model. 


Computational example Here we take a European put option with Sọ = 5, E = 
5, r = 0.05 and o = 0.3, with T = 3. We computed 500 discrete asset paths 
with time-spacings ôt = 10-7. The upper picture in Figure 9.4 plots S(T) on the 


horizontal axis against 
T1(S(T), T) + (P (So, 0) — T1(So, 0)) "7 (9.9) 


on the vertical axis for the case u = 0.2. There are 500 such points, one for 
each asset path. We computed P (Sg, 0) in (9.9) from the Black-Scholes formula 
(8.24). If the discrete hedging is successful, then an analogous identity to (9.5) 
holds for P (S(t), t). In particular, it holds at expiry, so (9.9) should agree with 
the put payoff max(E — S(T), 0). This ‘hockey stick’ payoff curve is superim- 
posed as a dashed line. We see that the dots lie close to the dashed line, and 
hence the discrete hedging algorithm behaves as predicted. The lower picture in 
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15 
S(T) 
Fig. 9.4. Large-scale discrete hedging example for a European put. Dots repre- 


sent normalized final payoff (9.9) for 500 asset paths. Exact hockey stick payoff is 
superimposed as a dashed line. Upper picture, u = 0.2. Lower picture, u = 0.4. 


Figure 9.4 shows the same computations with u changed to 0.4. This illustrates 
the phenomenon that the option value does not depend upon jz. ° 


9.5 Long-Term Capital Management 


There are many instances of academics with an expertise in mathematical finance 
turning their hands to real-life trading. The most high-profile and, ultimately, 
sobering example involves Long-Term Capital Management (LTCM). This was 
a hedge fund that invested money supplied by its partners and a limited number of 
wealthy clients. Two of the partners, closely involved in day-to-day trading strate- 
gies, were Robert Merton and Myron Scholes — founding fathers of the ‘rocket 
science’ of option valuation theory. The fund, set up in 1994, was extremely suc- 
cessful at raising capital and for a period of around four years produced impres- 
sively high returns. Although sometimes referred to as an arbitrage unit, LTCM 
typically scoured the international markets looking for low risk opportunities to 
make relatively small percentage gains. The fund used leverage — investing bor- 
rowed money — to scale up these tiny margins into large profits. One commentator 
likened their trades to ‘picking up nickels in front of bulldozers’ (Lowenstein, 
2001, page 102). At the peak of the fund’s success, Merton and Scholes received 
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their Nobel Prizes. However, in mid-1998 a combination of extreme events in the 
market plunged LTCM into deep trouble. One of the key difficulties they then 
faced was illiquidity. LTCM became desperate to offload a vast range of com- 
plicated portfolios, but the small set of potential buyers were, quite reasonably, 
holding out in the expectation that prices would drop further. (The assumption of 
liquidity — there always being a ready supply of buyers and sellers — is implicit in 
the Black-Scholes theory.) The bulldozers were moving in. The decline of LTCM 
and the enormity of its potential debts were brought to the attention of The Fed- 
eral Reserve Bank of New York (the Fed), a major component of the US Federal 
Reserve System. Quite remarkably, the Fed became concerned that bankruptcy of 
LTCM could create such a hole that the overall stability of the market was at threat. 
Very rapidly, the Fed managed to persuade a consortium of major banks and in- 
vestment houses to bail out LTCM in order to prevent the very real possibility of a 
total meltdown of the financial system.! 

Overall, a dollar invested in LTCM grew to a height of around $2.85, but 
dropped sharply to a paltry 23 cents, and the partners lost personal fortunes. A 
fast-paced and highly informative account of the LTCM debacle, with input from 
a number of first-hand witnesses, is given in (Lowenstein, 2001). 


9.6 Notes 


If you understand the hedging idea, it is perfectly reasonable for you to ask why 
options exist, that is, 


given that it is possible to reproduce the payoff of an option using only cash and the under- 
lying asset, why is there a market for options? 


One answer is that the Black-Scholes theory relies on assumptions that are not 
universally valid, and it is neither convenient nor feasible for most of us to carry 
out hedging. On one side there is a large group of investors who view options as 
an excellent means to alleviate their exposure to risk, and another large group who 
see options as a great way to speculate on the market. On the other side there is a 
complementary group of well-connected players, with the resources to manipulate 
complicated portfolios and negotiate relatively small transaction costs, who are 
willing to accept the Black-Scholes value plus a small premium. 


EXERCISES 
9.1. x Show from (9.1) and (9.2) that 0C/dS > OanddP/dS < 0. 


! Lowenstein (Lowenstein, 2001, page 198) quotes Sandy Warner from J. P. Morgan: ‘Boys, we’re going to a 
picnic and the tickets cost $250 million’. 
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%CH09 Program for Chapter 9 

% 

% Illustrates delta hedging by computing an approximate 

% replicating portfolio for a European call 

% 

% Portfolio is ‘asset’ units of asset and an amount ‘cash’ of cash 
% Plot actual and theoretical portfolio values 


randn(‘state’,100) 
clf 


APAPAP” A” APA” Problem parameters %%V%V%VWVWV%% 
Szero = 1; sigma = 0.35; r = 0.03; mu = 0.02; T = 5; E = 2; 

Dt = 1e-2; N = T/Dt; t = [0:Dt:T]; 

Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo ooo PP A% 


S = zeros(N,1); asset = zeros(N,1); cash = zeros(N,1); 
portfolio = zeros(N,1); Value = zeros(N,1); 


[C,Cdelta,P,Pdelta] = ch08(Szero,E,r,sigma, T-t(1)); 


S(1) = Szero; 

asset(1) = Cdelta; 

Value(1) = C; 

cash(1) = 1; 

portfolio(1) = asset(1)*S(1) + cash(1); 


fori=1:N 
S(i+1) = S(i)*exp((mu-0.5*sigma’2)* Dt+sigma*sqrt(Dt)*randn); 
portfolio(i+1) = asset(i)*S(i+1) + cash(i)*(1+r*Dt); 
[C,Cdelta,P Pdelta] = ch08(S(i+1),E,r,sigma,T-t(i+1)); 
asset(i+1) = Cdelta; 
cash(i+1) = cash(i)*(1+r*Dt) - S(i+1)*(asset(i+1) - asset(i)); 
Value(i+1) = C; 

end 


Vplot = Value - (Value(1) - portfolio(1))*exp(r*t)’; 
plot(t(1:5:end),Vplot(1:5:end),’bo’) 

hold on 
plot(t(1:5:end),portfolio(1:5:end),’r-’,/LineWidth’,2) 
xlabel(’Time’), ylabel(’Portfolio’) 
legend(’Theoretical Value’,/Actual Value’) 

grid on 


Fig. 9.5. Program of Chapter 9: ch09 .m. 
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9.2. xx By making reference to the limit definition 


aC . (cee) 
= lim 55 : 


as. 6S>0 

give an intuitive reason why 9C /dS > 0. Do the same for 0P/dS < 0. 

9.3. xx Using the expression (9.1), confirm the limiting behaviour for 
dC (S, t)/dS displayed in (9.7). 

9.4. xx Using the expression (9.2), confirm the limiting behaviour for 
dP(S,t)/dS displayed in (9.8). 

9.5. xx Give a financial argument that explains why 3P (S, t)/dS — —1 at expiry 
for an in-the-money put option and 0P(S,t)/dS — 0 at expiry for an out- 
of-the-money put option. 


9.7 Program of Chapter 9 and walkthrough 


Our program ch09 implements a discrete hedging simulation and produces a picture like the lower 
plots in Figures 9.1—9.3. It is listed in Figure 9.5. Here, S, asset, Value and cash are N by 1 arrays 
whose ith entries store the asset price, asset holding, Black-Scholes option value and cash holding at 
time t (i), respectively. After initializing parameters, we set up a for loop that updates the portfolio 
as described in Section 9.2. The Black-Scholes function ch08 from Chapter 8 is used to find the 
option value and the delta. 

On exiting the loop we superimpose the left- and right-hand sides of (9.5), plotting at every fifth 
time point. 


PROGRAMMING EXERCISES 


P9.1. Adapt chO9.m to investigate how the average discrepancy at expiry, (9.6), 
varies as a function of ôt. 


P9.2. Perform a large-scale test for a call option in the style of Figure 9.4. 


Quotes 


The professors were brilliant at reducing a trade to pluses and minuses; 
they could strip a ham sandwich to its component risks; 
but they could barely carry on a normal conversation. 
ROGER LOWENSTEIN (Lowenstein, 2001) 


After closing about 200 000 option transactions 
(that is separate option tickets) 
over 12 years and studying about 70 000 risk management reports, 
I felt that I needed to sit down and reflect on the thousands 
of mishedges I had committed. 
NASSIM TALEB (Taleb, 1977) 
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It is probably safe to say that 

the derivatives industry would be stuck in the psychedelic 60s, 

and many talented mathematicians would still be 

teaching freshman algebra for $20,000 a year 

had Black, Scholes, and Merton not made their contribution. 

DON M. CHANCE, ‘Rethinking Implied Volatility’ Financial Engineering News, January/ 
February 2003. 


10 
The Greeks 


OUTLINE 


e formulas for the Greeks 
e financial interpretations 
e confirmation that the Black-Scholes PDE is solved 


10.1 Motivation 


The Black-Scholes option valuation formulas (8.19) and (8.24) depend upon S, t 
and the parameters E, r and ø. In this chapter we derive expressions for partial 
derivatives of the option values with respect to these quantities. These results are 
useful for a number of reasons. 


Traders like to know the sensitivity of the option value to changes in these quantities. 
The sensitivities can be measured by these partial derivatives; see Exercise 10.1. 
Computing the partial derivatives allows us to confirm that the Black-Scholes PDE has 
been solved. 

Examining the signs of the derivatives gives insights into the underlying formulas. 

The derivative dV /dS is needed in the delta hedging process. 

The derivative ƏV /do0 comes into play in Chapter 14, where we compute the implied 
volatility. 


We focus on the case of a call option. Exercise 10.7 asks you to do the same 


things for a put. 


10.2 The Greeks 


Certain partial derivatives of the option value are so widely used that they have 
been assigned Greek names and symbols, ! 


aC 


A:= — 
chy 


(delta), 


1 Vega is not actually a Greek name, and does not even qualify for a symbol. 
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a2 

T := ae (gamma), 
aC 

p= 27 (rho), 
or 
aC 

O := — (theta), 
ot 
aC 

vega := — (vega). 

do 


By differentiating C in (8.19), using (8.20) and (8.21), it is possible to find 
explicit expressions for these quantities. Before launching into this process we 
make note of two useful facts. First, it follows from (3.18) that 


1 2 
N'(x) = a hai 
J 250 
Our second fact, 
SN' (di) — e 7! EN (do) =0, (10.1) 


is to be proved in Exercise 10.2. 
Differentiating with respect to S in (8.19) we have 
dd, sf = dd 
A = N(d\) + SN'(d\) = — Ee"! ON" (dg) — 
(dı) + =e e (d) ra 
N'(d l N'(d 
a = gee 
oVT -t SoVT -t 
Appealing to (10.1), we see that the second and third terms on the right-hand side 
cancel, giving 


A = N(d)), (10.2) 


a result that we used in Chapter 9. We then have 


0A dd N'(d 
pe aa a Se. (10.3) 
as as SoJ/T —t 
Next, differentiating C with respect to r we find that 
aC dd . . dd: 
ON Gp + (T = Hho ON SEE ON (d) — 
or or or 
= Che + (T —t)Ee?’? ON) 
ONT —t 


T-t 
— Ee T- N' (dy) ——_.. 
MA 
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As before, (10.1) allows us to cancel terms, and we find that 
p = (T —t)Ee"! ON (dy). (10.4) 


Similar analysis shows that 


e= D iss —rEe™ TÀN (dy) (10.5) 
IIT =F 


and 
vega = S/T — tN' (dı), (10.6) 
see Exercises 10.3 and 10.4. 


10.3 Interpreting the Greeks 


It is possible to interpret some of the Greek formulas from a financial viewpoint 
and to check that they agree with intuition. 

First we recall that the limiting behaviour of delta was characterized and inter- 
preted in Section 9.3. We also know from Exercise 9.1 that A > 0 up to expiry. 
This makes sense, because an increase in the asset price increases the likely profit 
at expiry. 

From (10.4) we see that p > 0 before expiry. To explain this we note that in- 
creasing the interest rate is equivalent to lowering the exercise price E. (The value 
of a fixed amount E at some fixed time in the future becomes less if the interest 
rate increases.) This makes a payoff more likely, which increases the value of the 
option. 

The expression (10.5) shows that © < 0. This property could also be deduced 
directly from the general, asset-model-independent argument in Section 2.6 con- 
cerning the monotonicity of the time-zero call option value with respect to the 
expiry date, see Exercise 10.5. 

The vega in (10.6) is always positive before expiry. This can be understood by 
considering that an increase in volatility leads to a wider spread of asset prices. 
However, assets moving deeper out-of-the-money have no effect on the option 
price (the payoff remains zero) while assets moving deeper into-the-money lead to 
a greater payoff. Because of this asymmetry, increasing o has a net positive effect. 
We return to vega in Chapter 14. 


10.4 Black-Scholes PDE solution 


Having worked out the partial derivatives, we are in a position to confirm that 
C(S,t) in (8.19) satisfies the Black-Scholes PDE (8.15). Using our expressions 
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for A, T, p and ©, we have 
IC ıı 3?C aC —So ers 
S? eS — — rC = ———N' (d1) —r Eel" ONG 
ar oe qg2 T! 55 1 Niner (d,;) —rEe (d2) 
N'(dı) 
+ $o? s? +rSN(d 
2 So/T —t (di) 
=f (SN) = Ee" T-N (d)) 
= 0. 


10.5 Notes and references 


Many texts present the formulas for the Greeks without getting into the nitty-gritty 
of differentiation. Exceptions are (Kwok, 1998; Nielsen, 1999). For more infor- 
mation on interpreting the Greek formulas, see (Hull, 2000; Kwok, 1998; Nielsen, 
1999), for example. 


10.1. 


10.2. 


10.3. 
10.4. 


10.5. 


10.6. 


10.7. 


EXERCISES 


x If F :R— R is differentiable, use the definition of the differentiation 
process to explain why F’(x) measures the sensitivity of F to changes in 
x. 

xx Verify the identity 


lo SNAG) =0 
8 e~ T-)EN'(d2) pre 


and hence derive (10.1). 

x x x Establish (10.5) and (10.6). 

xx Give a financial explanation why A < 0 for a put option (proved in 
Exercise 9.1). 

xx Show that the condition dC /dt < 0 can be deduced directly from the 
conclusion in Section 2.6 that the time-zero call option value is a non- 
decreasing function of the expiry date. 

x x x Using (10.1), show that the partial derivative dC /dE (which, sadly, 
does not have a Greek name) satisfies 


aC —r(T—t) 

es N (d2). 

JE á (42) 
Deduce that dC /dE < 0 and interpret this result. 


xxx Using the put—call parity identity (8.23), for each expression for a 
partial derivative of C that appears in this chapter obtain an expression 
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function [C, Cdelta, Cvega, P, Pdelta, Pvega] = ch10(S,E,r,sigma,tau) 
% Program for Chapter 10 
% This is a MATLAB function 


% 


% Input arguments: S = asset price at time t 


% 
% 
% 
% 
% 


E = exercise price 

r = interest rate 

sigma = volatility 

tau = time to expiry (T-t) 


% Output arguments: C = call value, Cdelta = delta value of call 


% 
% 
% 
% 


Cvega = vega value of call 
P = Put value, Pdelta = delta value of put 
Pvega = vega value of put 


% function [C, Cdelta, Cvega, P, Pdelta, Pvega] = ch10(S,E,r,sigma,tau) 


if tau > 0 
d1 = (log(S/E) + (r + 0.5*sigma‘’2)*(tau))/(sigma*sqrt(tau)); 
d2 = d1 - sigma*sart(tau); 
N1 = 0.5*(1+erf(d1/sqrt(2))); 
N2 = 0.5*(1+erf(d2/sqrt(2))); 
C = S*N1-E*exp(-r*(tau))*N2; 
Cdelta = N1; 
Cvega = S*sqrt(tau)*exp(-0.5*d1°2)/sqrt(2* pi); 
P = C + E*exp(-r*tau) - S; 
Pdelta = Cdelta - 1; 
Pvega = Cvega; 


else 
C = max(S-E,0); 
Cdelta = 0.5*(sign(S-E) + 1); 
Cvega = 0; 
P = max(E-S,0); 
Pdelta = Cdelta - 1; 
Pvega = 0; 
end 


Fig. 10.1. Program of Chapter 10: ch10.m. 


for the corresponding partial derivative of P. For each discussion of the 
sign of a partial derivative of the call option value, give a discussion of the 
corresponding sign for the put. In particular, show by example that ðP /dt 
may be positive or negative. Using your expressions, confirm that P (S, t) 
satisfies the Black-Scholes PDE (8.15). 
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10.6 Program of Chapter 10 and walkthrough 


The program ch10, listed in Figure 10.1, is an extended version of the function ch08 that returns 
values of the call and put vega. These values will be needed by the program ch14 in Chapter 14. The 
call vega formula is given by (10.6) and the put vega formula was derived in Exercise 10.7. 

An example of the function in use is 


>> S$ = 2; E = 2.5; r = 0.03; sigma = 0.25; tau = 1; 
>> [C, Cdelta, Cvega, P, Pdelta, Pvega] = ch10(S,E,r,sigma,tau) 


which outputs 


C = 0.0691 
Cdelta = 0.2586 
Cvega = 0.6470 

P = 0.4953 
Pdelta = -0.7414 
Pvega = 0.6470 


PROGRAMMING EXERCISES 


P10.1. Adapt function ch10.m to return more Greeks. 


P10.2. Investigate the use of MATLAB’s symbolic toolbox to confirm the results 
in this chapter. 


Quotes 


Proof: Use the Black-Scholes formula (6.46) and take derivatives. 
The (brave) reader is invited to carry this out in detail. 
The calculations are sometimes quite messy. 
THOMAS BJORK (on calculating the Greeks) (Bjork, 1998) 


Iam so glad I am a Beta, 
the Alphas work so hard. 
And we are much better than the Gammas and Deltas. 
ALDOUS HUXLEY from Brave New World, 1932 (1894-1963) 


You can overintellectualize these Greek letters. 
One Greek word that ought to be in there is hubris. 
DAVID PFLUG, source (Lowenstein, 2001) 


Neither Black nor Scholes, 
at first, 
knew how to derive the solution to these complicated equations, ... 
MARK. P. KRITZMAN (Kritzman, 2000) with reference to the Black-Scholes PDE 
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More on the Black—Scholes formulas 


OUTLINE 


e irrelevance of the asset growth rate 
e behaviour as time increases 

e Black-Scholes surfaces 

e re-scaling the formulas 


11.1 Motivation 


We now take the opportunity to reflect a little more on the Black-Scholes option 
valuation formulas. In particular, Figure 11.3 is an attempt to squeeze everything 
we have learnt into a single picture. 


11.2 Where is u? 


The Black-Scholes formulas allow us to determine a fair price at time zero for 
a European call or put option in terms of the initial asset price, Sọ, the exercise 
price, E, the asset volatility, o, the risk-free interest rate, r, and the expiry date, 
T . Each of these quantities is known, with the exception of the asset volatility, o. 
Chapters 14 and 20 are concerned with the task of estimating o using information 
available from the market. A big surprise, and perhaps the most remarkable aspect 
of the Black-Scholes theory, is that the option price does not depend on the drift 
parameter, u, which, from (6.11), determines the expected growth of the asset. A 
consequence is that two investors could have wildly different views about what is 
an appropriate value of u for a particular asset and yet, if they agreed on the volatil- 
ity and accepted the assumptions that go into the Black-Scholes analysis, they 
would come up with the same value for the option. This phenomenon, which may 
seem highly questionable at first glance, is a consequence of the fact that Black- 
Scholes determines a fair value for the option — a value that can be recovered 
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using the risk-free delta hedging strategy and hence the value, in the presence of 
arbitrageurs, that the forces of supply and demand dictate for the market. 
Suppose that there are two speculators, 


e Speculator A, who believes that the asset price will follow (6.9) with drift ua and volatil- 
ity o, and 

e Speculator B, who believes that the asset price will follow (6.9) with drift ug and volatil- 
ity o. 


Suppose the speculators wish to take a naked, long position on a European call 
option — that is, they wish to buy the option without performing any accompany- 
ing hedging. If ua > up then, presumably, Speculator A would find the Black— 
Scholes option value more attractive than Speculator B. This does not contradict 
the previous theory. A speculator who is willing to accept some risk may value 
an option differently to the Black-Scholes formula. However, if you are selling 
the option and wish to hedge in order to eliminate risk (and if you believe in the 
Black-Scholes assumptions) then (8.19) and (8.24) are the relevant values. 


11.3 Time dependency 


Figure 11.1 shows the Black-Scholes values of a call and a put option, as func- 
tions of asset price S, for certain fixed times t. We used E = 1,r = 0.05, 0 = 0.6 
and took expiry date T = 1. Figure 11.2 shows the same information in three- 
dimensional form. 

In both cases, we see that as t approaches the expiry date T, the option value 
approaches the hockey-stick payoff function. This will always be the case, as we 
showed in Exercise 8.1. In the case of a call option, for each S, the value appears 
to converge to the hockey stick monotonically from above as t approaches expiry. 
This is also generic, since, as we saw in Section 10.3, the time derivative, theta, 
is always negative. On the other hand, for the put option, the convergence is not 
uniformly from above or below. This is consistent with Exercise 10.7, where you 
were asked to show that a put’s time derivative can be negative or positive, see 
Exercise 11.2. 


11.4 The big picture 


Figure 11.3 draws the Black-Scholes European call option value, C (S, t), as a 
surface above the (S, t)-plane, This emphasizes that C (S, t) is a smooth function 
of S and t. Onto the C(S, t)-surface a solid white line adds the corresponding 
C (Si, ti) values mapped out by a discrete asset path. This picture illustrates that 
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— time=0 
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Fig. 11.1. Option value in terms of asset price at five different times. Upper: 
European call. Lower: European put. 


Fig. 11.2. Three-dimensional version of Figure 11.1. 
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0 s 


t 


Fig. 11.3. European call: Black-Scholes surface with asset path superimposed. 


e the Black-Scholes option value surface is smooth, 

e an asset path is jagged, 

e as time varies, an asset path maps out a jagged ‘option path’ over the smooth option 
value surface. 


Figure 11.4 repeats the exercise for a put option. 

In Figure 11.5 we plot the delta surface, dC /0S, for a call option and superim- 
pose three option paths. One option expires in-, one out-of- and one almost at-the- 
money. As discussed in Section 9.3, the rapid gradient of the delta surface induces 
large variations (and hence large swings in the amount of asset in the replicating 
portfolio) when the option is close to being at-the-money. Note from (9.1)—(9.2) 
that, since the vertical axis in the figure has no markings, the corresponding picture 
for a put option would be identical. 


11.5 Change of variables 


On the face of it, the Black-Scholes value of a European call or put option depends 
on the strike price, E, the expiry time, T, the volatility ø and the interest rate, r, 
as well as the asset price S and time t. However, by a judicious re-scaling, we can 
reduce the length of this list to two. 


11.5 Change of variables 
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Fig. 11.4. European put: Black-Scholes surface with asset path superimposed. 
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Fig. 11.5. Black-Scholes surface for delta with three asset paths superimposed. 
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We will introduce three new dimensionless quantities. First is the moneyness 
ratio 


To interpret m, we need to generalize (6.11) into the formula Se““—” for the ex- 
pected value of the asset at expiry, given asset price S at time t, Now we make the 
assumption that the asset growth rate equals the interest rate, u = r. This assump- 
tion will be examined in detail in Chapter 12; for now, we simply note that it leads 
to the following conclusions. 


If m > 0, then the expected asset value at expiry is greater than the strike price. In a ‘risk- 
neutral expectation at expiry’ sense, a call option is in-the-money and a put option 
is out-of-the-money. 

If m = 0, then, in the same sense, call and put options are at-the-money. 

If m < 0, then, in the same sense, a call option is out-of-the-money and a put option is 
in-the-money. 


Second, we have the scaled volatility 


Here, the volatility is combined with the square root of the time to expiry. This 
is natural, since, for example, volatility appears in the form o7(t;;;—1;) in the 
underlying asset model (6.9). The third step is to scale the option values by the 
asset price, by letting 


C= T for a call option, 
and 
; for a put option. 


In these new variables, dı and dù in (8.20) and (8.21) simplify to 


m T m T 
d= =+- and d = =-->, (11.1) 
T 2 T 2 


and, from (8.19) and (8.24), the re-scaled call and put values become 


c(m, T) = N (d1) — e” N (d2) and p(m,T) =e "N(—d2) — N (—d1), 
(11.2) 
see Exercise 11.3. 
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11.6 Notes and references 


Colour versions of Figures 11.3, 11.4 and 11.5 can be downloaded from this book’s 
website, mentioned in the preface. 


11.1. 


11.2. 


11.3. 
11.4. 


11.5. 


EXERCISES 


xx Consider the following ‘explanation’ of why the Black-Scholes Euro- 
pean call option value curve C(S, ft) lies above the payoff hockey stick 
max(S(t) — E,0), fort <T. 


Since E(S(t)) = Soe”, the asset price generically drifts upwards. Hence, on aver- 
age, the asset price will increase between time t and expiry, so the time ¢ value is 
greater than max(S (t) — E, 0). 


Is this argument valid? 
xx Show how Exercise 10.7 provides a counterexample to the following 
statement: 


As t goes from 0 to T, the Black-Scholes European put option value always ap- 
proaches the payoff hockey-stick function from below. 


xx Verify (11.1) and (11.2). 

x x x In the case where the volatility, o, is zero in the asset model (6.9), the 
final asset price is the nonrandom quantity Soe“! . The payoff from a Eu- 
ropean option is then guaranteed to be max(Sge“? — E, 0). It may thus be 
argued that the time-zero option value must be e~"? max(Sge“” — E, 0). 
However, this value clearly depends upon jz, whilst the Black-Scholes for- 
mula does not. (In fact, looking ahead to (14.2), the Black-Scholes value is 
e~'™ max(Soe’! — E, 0).) Can you resolve this apparent contradiction? 
xx Show that ‘Call(—o) = —Put(c)’, that is, replacing o in (8.19) by —o 
is equivalent to evaluating —P (S, t) in (8.24). This relation is sometimes 
called put—call supersymmetry. 


11.7 Program of Chapter 11 and walkthrough 


The program ch11 plots the Black-Scholes surface above the (S, ¢)-plane for a European call, in 
the style of Figure 11.3. It is listed in Figure 11.6. We initialize E,r,sigma and T, and set up the 


array Svals of 50 equally spaced asset prices between 0 and 3 and the array tvals of 50 equally 


spaced time points between 0 and T. The nested for loops then work through Svals and tvals, 


using ch08 to evaluate the Black-Scholes formula. The European call value is stored in the two- 


dimensional array Call. We then use meshgrid to set up two-dimensional arrays Smat and tmat 


that are appropriate for use with the three-dimensional plotting function mesh. 
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%CH11 Program for Chapter 11 
% 
% Draws Black-Scholes surface for European call 


clf 


APA” PPA” Problem parameters %%%%%%%%% 
E=1; r = 0.05; sigma = 0.2; T=1; L =50; 
Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Vo %o Vo Yo Yo Yo Yo Yo Yo Yo 


Svals = linspace(0,3,L); 
tvals = linspace(0,T,L); 
C = zeros(L,L); 
fori = 1:L 
S = Svals(i); 
for j = 1:L 
t = tvals(j); 
[Call,Calldelta,Put,Putdelta] = ch08(S,E,r,sigma,T-t); 
C(i,j) = Call; 
end 
end 


[Smat,tmat] = meshgrid(Svals,tvals); 
mesh(Smat,tmat,C’) 
ylabel(’S’), xlabel(’t’), zlabel(’C(S,t)’) 


Fig. 11.6. Program of Chapter 11: ch11.m. 


PROGRAMMING EXERCISES 


P11.1. Edit chi1.m so that it applies to a European put option, as in Figure 11.4. 


P11.2. Edit ch11.m so that it applies to the delta of a European call option, as 
in Figure 11.5, and investigate the use of surf, surfc and waterfall instead of 
mesh. 


Quotes 


The Black-Scholes formula is still around, 

even though it depends on at least 10 unrealistic assumptions. 

Making the assumptions more realistic 

hasn’t produced a formula that works better across a wide range of circumstances. 
FISCHER BLACK (Black, 1989) 


We know this doesn’t work by rote. 

But this is the best model we have. 

You look at the old-timers who went with their gut. 
You had this model, you had these numbers, 
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and in the end you thought they were a lot more powerful than a guy’s gut. 
ROBERT STAVIS, former member of the Arbitrage group at Salomon Brothers, source 
(Lowenstein, 2001) 


A first-rate theory predicts, 
a second-rate theory forbids 
and a third-rate theory explains after the event. 
ALEXANDER KITAIGORODSKI, 1975, 
source www.byrmeweb.com/sunburn/quotes. html 
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Risk neutrality 


OUTLINE 


e option value as expected payoff 
e risk neutrality 


12.1 Motivation 


In the days before the Black-Scholes formula, it was often argued that a reasonable 
way to value an option is to take the expected payoff. In this chapter we show how 
the expected payoff idea fits in with the Black-Scholes methodology. This leads us 
to the concept of risk neutrality, which will play a fundamental role in Chapters 15, 
16 and beyond, when we discuss computational algorithms. 


12.2 Expected payoff 


To cover European call and put options in a single notation, we let A(x) denote the 
payoff function, so A(x) = max(x — E, 0) for a call and A(x) = max(E — x, 0) 
for a put. The treatment here easily generalizes to other European-style options, 
that is, options whose payoff may be expressed as a function of the asset price at 
expiry. 

Under our model (6.8), the final asset price, S(T), is a random variable of the 
form S(T) = Soe #—2°/DT+0vTZ where Z ~ N(O, 1). So the payoff, A(S(T)), 
is also a known random variable. Why don’t we simply take the time-zero option 
value to be the average payoff, suitably discounted for interest? This gives a value 


e  E(A(S(T))). (12.1) 
Using (3.8) and the density function (6.10), this may be written 
2 
pf AG) (log.x — log So — (u — 30?) ) 
e —— exp | — dx. (12.2 
| xoJV2nJ/T p 202T (12.2) 
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More generally, we could regard the option value at asset price S and time f as the, 
suitably discounted, expectation of the payoff. Letting W (S, t) denote this value, 
we have 


W(S,t) =e" -E (A(S(T)), given asset price S at time t), (12.3) 


which may be written more explicitly as 
WS, t) = e7 PP T _ ^0) 
0 xoV2n/T -t 
2 
(log.x — log S = (u — 402\(T = n) 
d 


EEEF 202T —1) 


x. (12.4) 


The values (12.2) and (12.4) are certainly relevant to an individual who is in the 
habit of writing or holding naked options. However, in comparison with the Black— 
Scholes approach to finding a fair option value, there are a number of related points 
to make. 


(i) Formulas (12.2) and (12.4) were derived without any reference to the idea of hedging 
to eliminate risk. 
(ii) Formulas (12.2) and (12.4) were derived without any reference to the no arbitrage 
principle. 
(iii) Unlike the Black-Scholes PDE (8.15), the formulas (12.2) and (12.4) depend on the 
parameter p. 


Now the Black-Scholes theory tells us that there is only one fair value, and 
this must be the figure quoted in the market. If the market placed the option 
lower/ higher, arbitrageurs would swoop en masse, buying/selling the option, delta 
hedging until expiry, and hence guaranteeing a riskless profit. The forces of supply 
and demand therefore constrain the option to the Black-Scholes level. It follows 
from point (iii) that the expected payoff approach cannot be used to get a fair value. 

On the face of it, expected payoff seems to have no place in option valuation 
theory. However, by a remarkable twist, it is possible to rehabilitate the idea. 


12.3 Risk neutrality 


Figure 12.1 confirms that the time-zero discounted expected payoff (12.2) is indeed 
a function of u. The solid line plots (12.2) as u varies from 0 to 0.1 for a European 
call with Sg = 10, E = 9, r = 0.05, o = 0.2 and T = 3. As we would guess, the 
expected payoff increases with the growth rate, u. Superimposed on the picture as 
a dashed line is the Black-Scholes option value, 2.66. 
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Fig. 12.1. Time-zero discounted expected payoff (12.2) for a European call. 
Black-Scholes value superimposed as a dashed line. 


Keen-eyed observers will note that the solid curve in Figure 12.1 appears to 
pass through the Black-Scholes level at the value u = r = 0.05; that is, when the 
growth rate parameter matches the interest rate. This turns out to be no coinci- 
dence. Exercise 12.1 asks you to verify the general result that 


W (S, t) in (12.4) satisfies the Black-Scholes PDE (8.15) when u =r. 


Now we check the final time and boundary conditions. Taking t = T in (12.3), 
we note that if S(T) is given, and thus nonrandom, then E(A(S(T))) = A(S(T)), 
giving 


W(S,T) = A(S(T)). 


Hence the conditions (8.16) for a call and (8.25) for a put are satisfied. Similarly, 
if S = 0 at any time then we know from (6.9) that S(T) = 0, and hence in (12.3) 


W(0,t) =e "2 AO). 


This matches (8.17) and (8.26) for the call and put, respectively. Finally, we note 
that the arguments given to justify (8.18) and (8.27) are equally valid for (12.3). 
Overall, since W(S,t) with u =r satisfies the same PDE and the same final 
time/ boundary conditions, the uniqueness of the solution tells us that 
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W (S, t) in (12.4) reproduces the Black-Scholes option value when u =r. 


We could re-write this conclusion as follows. 


No matter what parameters jz and o in the asset model (6.9) we believe to be correct, we 
can obtain the Black-Scholes option value by pretending that the drift, u, is equal to the 
interest rate, r, and taking the discounted expected payoff. 


In setting u = r we are making what is known as a risk neutrality assumption. 

We will see in Chapters 15 and 16 that the risk-neutral expectation framework 
allows us to develop computational methods for approximating options where an- 
alytical formulas are not available. 


12.4 Notes and references 


It is perfectly standard, but not particularly enlightening, to give the name risk 
neutrality to the condition u = r. The phrase borrows from the concept of a risk- 
neutral investor; an unlikely person who regards 


e an investment with guaranteed rate of return r, and 
e arisky investment with expected rate of return r 


as equally attractive. In the case where all assets satisfy the lognormal model (6.9) 
with the same growth parameter u — the so-called risk-neutral world — we see from 
(6.11) that a risk-neutral investor would have no preferences between investing in 
a bank and in any asset. 

In the risk-neutral world, (6.11) shows that E(S(t)) = Soe”’, so the expected 
discounted asset price is E(e~’’S(t)) = So. In other words, the expected dis- 
counted asset price does not change with time; it remains at its time-zero level. A 


process like this, whose expected future value is given by its current value, is called 
a martingale. By using martingale theory it is possible to convert the simple obser- 
vation in Exercise 12.1 into a rigorous and powerful theory for option valuation. 
In particular, this is an alternative way to derive the Black-Scholes formulas. The 
texts (Duffie, 2001; Karatzas and Shreve, 1998; Nielsen, 1999) cover this material 
in depth, while perhaps the most accessible introduction is (Baxter and Rennie, 
1996). Chapter 6 of (Kritzman, 2000) also gives a very readable, example-driven 
coverage of risk neutrality. 

In Chapter 16 we introduce the binomial method as a computational technique 
for option valuation. It is also possible to use the binomial framework as an 
analytical tool with which the Black-Scholes formulas can be derived without 
recourse to PDEs. The concept of risk neutrality arises quite naturally in this 
setting. Exercise 12.5 provides a cut-down version of the idea. The text (Baxter 
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and Rennie, 1996) and the on-line lecture notes of Professor Robert Kohn at 
www.math.nyu.edu/faculty/kohn/ are good places to learn more. 


12.1. 


12.2. 


12.3. 


12.4. 


12.5. 


EXERCISES 


x x x Using a large sheet of paper and a pen with plenty of ink, show that 
for u =r the quantity W (S, t) in (12.4) satisfies the Black-Scholes PDE 
(8.15). (You may differentiate inside the integral sign without worrying 
about whether this is justified.) 

xxx» Consider a European-style option with payoff at expiry given by 
A(S(T)) = S(T). Explain why the time-zero value of this option must be 
So. By using (6.11), show that asking for the discounted expected payoff 
(12.1) to match this value leads immediately to the risk neutrality condition 
u =r. 

xx Given initial asset price So at time t = 0, show that, in a risk-neutral 
world, the factor N (d2) in the Black-Scholes formula (8.19) represents the 
probability that a European call option will be exercised. 

x x x Show that the value W (S, t) in (12.4) can be computed from the fol- 
lowing recipe. 


(i) Compute the Black-Scholes option value at (S, t) with the interest rate set 
tor =u. 
(ii) Scale this quantity by e#—7—9, 


(This recipe was used to create Figure 12.1.) 
x x x Consider the following, simplified scenario for valuing a European- 
style option. 


e The time-zero asset price is So. 
e Atexpiry, the asset price may take only two possible values 


S(T) = Sup > So, with probability p, 
S(T) = Sdown < So, with probability 1 — p. 


Let A denote the payoff function, and let Aup := A (Sup) and Adown := 
A(Sdown) denote the two possible payoffs at expiry. Take a portfolio at time 
t = 0 consisting of A units of asset and an amount C of cash. Asking for 
this portfolio to replicate the option (i.e. to have payoff Ayp when S(T) = 
Sup and Adown when S(T) = Sdown) leads to a pair of linear equations for 
A and C. Find and solve these to obtain 


Aup — Adown 


A= (12.5) 


Sup — Sdown 
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ay Aup — Aa 
C=e x [Aona = (Eet) Saon | : (12.6) 
up — down 


Then use the no arbitrage principle to deduce that a fair time-zero value for 
the option is 


s (= = ee) + eT (Ce = none (12.7) 
0 ea a . . 
Sup — Sdown Sup — Sdown 


Now, let 


Soe"! — Sdown 

Gee ne 
Sup — Sdown 

Use the no arbitrage principle to argue that 0 < g < 1 must hold. Show 

that the value in (12.7) may also be interpreted as the discounted expected 

payoff of an asset taking the values 


S(T) = Sup > So, with probability q, 
S(T) = Sdown < So, with probability 1 — q. 


Can you see any features from this simplified scenario that carry through 
to the Black-Scholes version? 

12.6. xx x In Section 10.3 we gave a financial interpretation of the inequality p > 
0. Use the risk neutrality viewpoint to give an alternative interpretation. 


12.5 Program of Chapter 12 and walkthrough 


The program ch12, listed in Figure 12.2, illustrates risk neutrality in the manner of Figure 12.1. We 
fix S,E,r,sigma and T and an array of 200 values for mu. A for loop is then used to compute an 
array epayoff which stores the discounted time-zero Black-Scholes value when r is set to each mu 
value; see Exercise 12.4. This is done via the ch08 function from Chapter 8. After executing this 
loop, we use ch08 to obtain the true Black-Scholes value, C. We then plot the (muvals,epayoff) 
curve and superimpose a dashed line at height C. 


PROGRAMMING EXERCISES 


P12.1. Confirm experimentally the result mentioned in Exercise 12.3. Do this by 
generating a large number of expiry-time asset prices, and counting the proportion 
that are in-the-money. 


P12.2. Investigate the use of quad and quad1 for evaluating integrals of the form 
(12.4). 
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%CH12 Program for Chapter 12 

% 

% Compute expected payoff for European call 
% Illustrates risk neutrality 


clf 


AAPP% Problem parameters %%%%%% 
S=5; E=7; r=0.08; sigma = 0.3; T= 1; 
M = 200; muvals = linspace(0,0.16,M); 

Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo oP P P P P P A 


epayoff = zeros(M,1); 

fork =1:M 
mu = muvals(k); 
% work out time-zero Black-Scholes value with r = mu 
[C, Cdelta, P, Pdelta] = ch08(S,E,mu,sigma,T); 
epayoff(k) = exp((mu-r)*T)*C; 

end 


% true Black-Scholes value 

[C, Cdelta, P, Pdelta] = ch08(S,E,r,sigma,T); 
plot(muvals,epayoff,’r-’); 

hold on, grid on 
plot([muvals(1),muvals(end)],[C,C],’b-’); 

xlabel(’\mu’), legend(’Expected payoff’,’Black-Scholes’) 


Fig. 12.2. Program of Chapter 12: ch12.m. 


Quotes 


...risk-neutrality is far from easy to grasp intuitively, 
which is perhaps the source of the confusion above. 
The key steps in the derivation of the Black-Scholes equation, 
namely no arbitrage and that risk-free portfolios can earn the risk-free rate, 
are intuitively clear. 
PAUL WILMOTT, SAM HOWISON AND JEFF DEWYNNE (Wilmott et al., 1995) 


Risk neutral valuation, which was developed by John Cox and Stephen Ross, 
has the dual virtues that it can be applied to practically any option valuation problem 
and it is marvelously intuitive. 
MARK P. KRITZMAN (Kritzman, 2000) 


To put it simply, 
if there is an arbitrage price, any other price is too dangerous to quote. 
MARTIN BAXTER AND ANDREW RENNIE (Baxter and Rennie, 1996) 
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Solving a nonlinear equation 


OUTLINE 


e general problem 
e bisection method 
e Newton’s method 


13.1 Motivation 


In the next chapter, where we look at computing the implied volatility, we will 
need an algorithm for solving a nonlinear equation. This chapter introduces two 
such algorithms. 


13.2 General problem 


The task that we consider in this chapter is 


given a function F : R > R, find an x* € R such that F (x*) = 0. 


In general, of course, we cannot find an x* analytically, and must therefore con- 
tent ourselves with an approximation via a computational method. It is also worth 
keeping in mind that, depending on the nature of F, there may be no suitable x*, 
exactly one x* or many x* values. 


13.3 Bisection 
The bisection method is based on the observation that if a continuous function 


changes sign then it must pass through zero; that is, 


for continuous F, if xg < xp with F (xq) F (xp) < 0, 
then F (x*) = 0 for some Xa < x* < Xp. 


Having found xg and xp with F (xg) F (xp) < 0, we could evaluate F at the mid- 
point Xmid := (Xa + Xp)/2. The sign of F (xmia) must then match either F (xa) or 
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F (xp). This means that one of the intervals [xa, Xmia] or [Xmia, xp] must contain 
an x*. By repeating this process, we can construct an arbitrarily small interval in 
which an x* must lie — hence we can find an x* to any level of accuracy. 

We may thus spell out the bisection method as follows. 


Step 1: Find xg and xp with xg < xp such that F (xq) F (xp) < 0. 

Step 2: Set xmia := (Xa + xp)/2 and evaluate F (xmia). 

Step 3: If F (xq) F (Xmia) < 0 then reset xp = Xmid. 
Otherwise reset xg = Xmid- 

Step 4: If x, — xq < € then stop. Use 5 (Xa + xp) as the approximation to x”. 
Otherwise return to Step 2. 


Note that we must choose a value € > 0 for our stopping criterion xp — Xa < €. 
It is easy to see that the value (xg + xp)/2 on termination is no more than a distance 
é/2 from a solution x*. Hence, ¢ controls the accuracy of the process. 

There is no foolproof procedure for finding suitable x4 and x, in Step 1. Without 
specific knowledge of the function F we must resort to trial and error. 

Because the bisection method halves the length of the interval [xa, xp] on each 
iteration, we may bound the error at the kth iteration by L/2%*!, where L is the 
length of the original interval, x, — xa. This is referred to as a linear convergence 
bound because the error bound decreases by a linear factor, in this case 7 on each 
iteration. We consider next a faster method. 


13.4 Newton 


Newton’s method (also called the Newton—Raphson method) can be derived in 
a number of ways. We will use a Taylor series approach. Suppose we wish to 
compute a sequence xo, x1, x2, ... that converges to a solution x*. We may expand 
F (xn + ô) for small 6 by 


F (xn + 8) = F (xn) + 8F' (Xn) + O (8°). (13.1) 


Ignoring the O(6*) term and setting F(xn)+8F'(xn)=0 gives ê= 
—F (xn)/F' (xn). It follows that if x, is close to a solution x* then 
Pe 

F' (xn) 


Xn+1 = Xn (13.2) 
should be even closer. Given a starting value, xo, the iteration (13.2) defines New- 
ton’s method. 

Since we discarded an O (8°) term in (13.1), we may expect that the error 
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Xn — x* squares as n increases to n + 1; that is, if x, — x* = O(6) then xn41 — 
x* = O (8°). To see this more clearly, note that, using F(x*) = 0 and assuming 
F'(xn) Æ 0 in (13.2), a Taylor series gives 


F (xn) — F (x*) 
F' (xn) ) 
(Xn — X*)F' Xn) + O (xn — x*)*) 
F'(Xn) 
-0 (r 2 S) l (13.3) 


TEETE 


= Xn — x* — 


This type of analysis can be formalized to give the following result. 


Theorem 1 Suppose F has a continuous second derivative, and suppose x* € 
R satisfies F(x*) = 0 and F’(x*) Æ 0. Then there exists a ô > O such that for 
|xo — x*| < 6 the sequence given by (13.2) is well defined for all n > 0, 


lim |x, —x*|=0 
n—> oœ 
and there exists a constant C such that 


eee x*| < Cle — x*|?. (13.4) 


The bound (13.4) shows that Newton’s method has quadratic or second or- 
der convergence. However, the result requires the starting value xo to be chosen 
sufficiently close to x*. In practice Newton’s method works very well when a 
suitable xo is found, but may fail to converge otherwise. 


Computational example Suppose we wish to find the value of x* such that 
P(X < x*)= Z, where X ~ N(0, 1). Equivalently, we want to solve F (x) = 0, 
where F(x) := N (x) — Z with N (x) defined in (3.18). It follows from the defi- 
nition of N (x) that F (x) is an increasing function of x with F (0) = 5 — 3 <0 
and limy +o F(x) =1— 3 > 0. Hence, we may immediately conclude that 
F(x) = 0 has a unique solution 0 < x* < oo. This can be confirmed from the 
plot of F(x) in Figure 13.1. We may apply the bisection method with xy = 0 
and with x, sufficiently large that F (xp) > 0. For the choice x, = 10 and a tol- 
erance of £ = 1075 in the stopping criterion, the errors |xmiq — x*| are shown as 
asterisks in the left-hand plot of Figure 13.2. Note that the y-axis is logarithmi- 
cally scaled. We see that 20 iterations were taken in the bisection method. The 
ie has been added to the plot. The pre- 
ceding analysis shows that the error lies below this line. The right-hand plot in 


dashed line corresponding to 10 x ( 5 


Figure 13.2 shows the corresponding errors for Newton’s method. Here we set 
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Fig. 13.1. The function F(x) := N(x) — Z, 
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Fig. 13.2. Error in the bisection method (left) and Newton’s method (right). A 
reference line of slope —1 has been added in the left-hand plot. 
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xo = 1 and stopped when |xy41 — Xn| < 1075. We see that only 4 iterations were 
required to produce an error of around 107 !2 and the error roughly squares from 
one step to the next. Repeating Newton’s method with xo = 2, however, resulted 
in a sequence that ‘blew up’ — the numbers became too large for the computer to 
store. © 


13.5 Further practical issues 


There are many issues that we have not addressed here. It is possible, for exam- 
ple, to design a hybrid algorithm that uses a safe method, like bisection, until 
the iterates are close to an x* and then switches to Newton’s method to get the 
benefit of rapid convergence. Also, the residual |F (x,)| gives a measure of how 
close x, is to a solution, and this can be incorporated into the stopping criterion. 
Furthermore, although we have considered only a single nonlinear equation, it is 
possible to generalize Newton’s method to the case of many equations in many 
unknowns. 


13.6 Notes and references 


Most introductory numerical analysis texts have a chapter on solving nonlinear 
equations. An excellent and up-to-date specialist treatment that includes MAT- 
LAB codes is (Kelley, 1995). The classic advanced text is (Ortega and Rheinboldt, 
1970). 

If you need to brush up on Taylor series, order notation and, for the next chap- 
ter, the Mean Value Theorem, there are many introductory texts to choose from; 
(Estep, 2002) is an excellent modern treatment. 


EXERCISES 


13.1. x Suppose that Step 1 of the bisection method has been completed for a 
continuous function F and let L = xp — xq. In terms of L and £, how many 
iterations of Steps 2—4 will be taken? Check that your answer is consistent 
with the left-hand plot in Figure 13.2. 

13.2. xx Consider the following approach to computing a sequence of approxima- 
tions x9, x1, %2,... to x*. Given x, let x41 be the solution to p(x) = 0, 
where p(x) is an approximation to F(x) determined by the three con- 
ditions (a) p(x) is linear, (b) pn (Œn) = F (Xn) and (c) pp Œ) = F'n). 
Draw a picture to illustrate this construction and then show that xn+1 
is given by (13.2). (Hence, this is an alternative derivation of Newton’s 
method.) 
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xx To compute the errors that are shown in Figure 13.2 it was ne- 
cessary to obtain the exact solution x*. This was done by setting 
xstar = sqrt(2)*erfinv(1/3) where erfinv is MATLAB’s built-in 
routine to evaluate the inverse error function described in Exercise 4.3. 
Confirm that xstar is the required solution. 

xx Look at Figure 13.1. Using a ruler and pencil, and following the lin- 
earization approach in Exercise 13.2, convince yourself that Newton’s 
method will converge with the starting value x9 = 1, but will not converge 
with the starting value x9 = 2. 


13.7 Program of Chapter 13 and walkthrough 


In ch13, listed in Figure 13.3, we apply Newton’s method to N (x) + e* = 2. The line 
exact = fzero(inline(‘0.5*(1terf(x/sqrt(2))) + exp(x)- 2’),1); 


%CH13 


% 


Program for Chapter 13 


% Apply Netwon’s method to N(x) + exp(x) = 2. 


exact = fzero(inline(’0.5*(1+erf(x/sqrt(2))) + exp(x)- 2’),1); 


x0=1; 

x = x0; 
xdiff = 1; 
k=1; 
kmax = 100; 
tol = 1e-8; 


while (xdiff >= tol & k < kmax) 
Fval = 0.5*(1+erf(x/sqrt(2))) + exp(x) - 2; 
Fprime = exp(-0.5*x°2)/sqrt(2* pi) + exp(x); 
increment = Fval/Fprime; 
x = X - increment; 
xnewton(k) = x; 
newterr(k) = abs(xnewton(k)-exact); 
k = k+1; 
xdiff = abs(increment); 
end 


format short e % non-default for number display 
disp(’Newton error’) 

disp(newterr’) 

format % reset to default for number display 


Fig. 13.3. Program of Chapter 13: ch13.m. 
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uses MATLAB’s built-in equation solver {zero to compute an ‘exact’ solution, which we 
use for reference. The syntax 


while (xdiff >= tol & k < kmax) 


end 


sets up a loop that repeats while both xdiff >= tol andk < kmax remain true. In other 
words, the loop terminates when either xdiff drops below tol or the maximum number, 
kmax, of iterations has been reached. Inside the loop we implement Newton’s method for 
the problem. The error in each iterate is stored in the array newterr. 

On exiting the loop, we output the errors. The line format short e sets up a number 
display format that is appropriate for this output. At the end of the program we reset the 
display to the default with format. 

Output from ch13 is 


Newton error 
1.5465e-01 
8.3622e-03 
2.4964e-05 
2.2279e-10 
1.1102e-16 


This is consistent with the quadratic convergence discussed in Section 13.4 — the error 
roughly squares from one iteration to the next until it reaches a level that the machine 
cannot distinguish from zero. 


PROGRAMMING EXERCISES 


P13.1. Investigate the convergence of the bisection method on the problem solved 
by ch13. 


P13.2. Using your answer to programming exercise P12.2, apply bisection to con- 
firm that the two curves displayed in Figure 12.1 intersect at u =r. 


Quotes 


Chance has put in our way a most singular and whimsical problem, 
and its solution is its own reward. 
SHERLOCK HOLMES, in The Adventure of the Blue Carbuncle by Sir Arthur Conan 
Doyle 


A blunder is an accidental mistake, 
as opposed to an approximation error, which is merely a compromise. 
ROBERT M. CORLESS (Corless, 2002) 
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Implied volatility 


OUTLINE 


e the need for implied volatility 

e properties of option value as a function of o 

e bisection and Newton for computing the implied volatility 
e volatility smiles and frowns 


14.1 Motivation 


We now put the bisection method and Newton’s method to work on the problem 
of computing the implied volatility. 


14.2 Implied volatility 


The Black-Scholes call and put values depend on S, E,r, T — t and o. Of these 
five quantities, only the asset volatility o cannot be observed directly. How do 
we find a suitable value for ø? One approach is to extract the volatility from the 
observed market data — given a quoted option value, and knowing S, t, E,r andT, 
find the o that leads to this value. Having found o, we may use the Black-Scholes 
formula to value other options on the same asset. A o computed this way is known 
as an implied volatility. The name indicates that o is implied by option value data 
in the market. 

A completely different way to get hold of o is described in Chapter 20. 

We focus here on the case of extracting o from a European call option quote. An 
analogous treatment can be given for a put, or, alternatively, the put quote could be 
converted into a call quote via put—call parity (8.23). 


14.3 Option value as a function of volatility 


We assume that the parameters E, r and T and the asset price S and time ¢ are 
known. (In practice, we will typically be interested in the time-zero case, t = 0 
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and S = So.) We thus treat the option value as a function of ø only, and, for the 
rest of this chapter, denote it by C (o). Given a quoted value C*, our task is to find 
the implied volatility o* that solves C (o) = C*. 

Computing the implied volatility requires the solution of a nonlinear equa- 
tion and hence, from Chapter 13, we may use the bisection method or Newton’s 
method. We will find that it is possible to exploit the special form of the nonlinear 
equation arising in this context. 

Since volatility is non-negative, only values ø € [0, oo) are of interest. Let us 
look at C (o) in the case of large or small volatility. First, as o — oo, we see from 
(8.20) that dj — co and hence N (d1) —> 1. Similarly, from (8.21), as o —> œ, 
dy — —œ and hence N (d2) — 0. It follows in (8.19) that 


lim C(o) =S. (14.1) 
T-7009 


Next, we look at the limit ø —> O* and separate out three cases. 


Case 1: S— Ee~"7-9 > 0. In this case log(S/E)+r(T —t) >0, so as o > OF 
we have dj > œ, N(d\) > 1, da > œ and N(d2) —> 1. Hence, C > S$ — 
Ee T-t), 

Case 2: S — Ee"? <0. In this case log(S/E) +r(T — t) <0, so as o > 0t we 
have di > —oo, N (dı) —> 0, dy —> —œ and N (d2) — 0. Hence, C > 0. 

Case 3: S — Ee~"7-9 = 0. In this case log(S/E)+r(T —t)=0, so as o > OF 
we have dı > 0, N (d1) > T dy — 0 and N (d2) > T Hence, C > 5(S — 
Ee T-0) = 0. 


The three cases are summarized neatly by the formula 


lim C (0) = max(S — Ee™T-) 0). (14.2) 
o—>0 


Now we recall from Chapter 10 that the derivative of C with respect to o, that is, 
the vega, is given by (10.6). In particular, we know that dC /do > 0. Since C (o) 
is continuous with a positive first derivative, we conclude that C is monotonic 
increasing on [0, oo). From (14.1) and (14.2), values of C (o) must lie between 
max(0, S — Ee~""—) and S. It follows that C(o) = C* has a solution if and 
only if 


max(S — Ee "7-9 0) < C* < S, (14.3) 


and if a solution exists it is unique. Henceforth, we assume that this condition 
holds. For further justification of this assumption we note from Section 2.6 that if 
(14.3) is violated then an arbitrage opportunity exists. 
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For later use, we will calculate the second derivative. Differentiating (10.6) 
gives 
°C S/T =t - ta? g Odi 
=— e —. 
3o? 2x Jo 


From (8.20) we have 
dd, — log(S/E)+r(T —t) ı 


ao sT t 5 T -t 
_ Plog(S/E) + (r — (6?/2))T — 1) 
Seer eg 

d2 
oe 


and hence 


aC SVT -t sgedidy _ didz 3C 
— e = $ 
3o? 2T oO o ðo 


It follows from (14.4) that ƏC /do is maximum over [0, oo) ato = F, where 


(14.4) 


log S/E +r(T —t 
a= j SEE ETES (14.5) 
T -t 
see Exercise 14.1. Moreover, 0*C /3o? may be written in the form 
3C Tat 4ôC 
— = — —, 14.6 
do? 403 Cree Ja re) 


see Exercise 14.2. The identity (14.6) shows us that C (o) is convex for o <@ 
and concave for o > G. This will allow us to get a globally convergent Newton 
iteration by suitably choosing the starting value. 


14.4 Bisection and Newton 


We will write our nonlinear equation for o* in the form F (o) = 0, where F (o) := 
C(o) — C*. To apply the bisection method, we require an interval [o,, op] over 
which F (o) changes sign. It follows from (14.1), (14.2) and the monotonicity of 
C (o) that this can be done by fixing K (say K = 0.05) and trying [0, K], [K, 2K], 
[2K ,3K],.... 

Newton’s method takes the form 


F (Gn) 
F'n)’ 


(14.7) 


On+1 = On 
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where F'(o) = 0C /0q is given by (10.6). Because we know a lot about F , we can 
exploit an expansion along the lines of (13.3) that keeps track of the remainder. 
Using F (o*) = 0 and the Mean Value Theorem, we have 


* * F (on) — F(o*) 
On+1 Z0 = On, -0 — —— 


F' (on) 
= On — o* = (On = gE: 
F' (on) 
for some &, between o, and o*. Hence, we may write 
— 7* F' 
On+1 — 0 ZNE (En) (14.8) 
On — O* F' (on) 


We know that F’(o) is positive and takes its maximum at the point in (14.5). 
Hence, using the starting value oo = @ we must have 0 < F’(&) < F'(©) in 
(14.8), so that 


oj — o* 


0< 


eed (14.9) 
09 — 0o 

This means that the error in o1 is smaller than, but has the same sign as, the error 

in oo. To proceed we suppose that © < o*. Then (14.9) tells us that og < o < 

o*. Now, we know from (14.6) that F”(o) < 0 for all o > G and we also know 

that € in (14.8) lies between g and o*. Hence 0 < F’(&,) < F’(o,) and (14.8) 

gives 


o — o* 
0< <1 
o] — o* 
Continuing this argument gives 
= * 
ga TETE et. “orale =O, (14.10) 


On — o* 
So the error decreases monotonically as n increases. 

In a similar manner, it can be shown that (14.10) holds in the case where > 
o*, see Exercise 14.3. Overall, we conclude that with the choice oo = @ the error 
will always decrease monotonically as n increases. It follows that the error must 
tend to zero, and the theory from Chapter 13 then shows that convergence must 
be quadratic. Hence, oo = G is a foolproof starting value for Newton’s method 
on this particular nonlinear equation. This is therefore our method of choice for 
computing the implied volatility. 


Computational example Figure 14.1 illustrates the performance of Newton’s 
method in the case where So = 3, E = 1,r = 0.05, T = 3 and t = 0. We used 
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Fig. 14.1. Newton’s method for the implied volatility. Upper picture: iterates. 
Lower picture: errors. 


o* = 0.15 in order to compute the Black-Scholes value for C, and then applied 
Newton’s method to see how quickly o* could be found. We took the starting 
value oo = © given by (14.5), so monotonic convergence is guaranteed. The up- 
per picture in Figure 14.1 shows the curve C (o) and superimposes the start- 
ing value (o0, C (o9)) and the subsequent iterates (on, C (o,)). The lower picture 
plots the size of the error, |o, — o*|. We see that the initial convergence is quite 
slow, but ultimately the characteristic second order behaviour emerges. The slow 
initial decrease in the error may be caused by the fact that F’(o*) is close to zero. 
(Recall that F’(o*) Æ 0 is an assumption in the convergence theorem. If F’(a*) 
were exactly zero then Newton’s method would converge at a rate slower than 
quadratic — see (Ortega and Rheinboldt, 1970), for example). © 


14.5 Implied volatility with real data 


We now look at the implied volatility for call options traded on the London In- 
ternational Financial Futures and Options Exchange (LIFFE), as reported in the 
Financial Times on Wednesday, 22 August 2001. The data is for the FTSE 100 
index, which is an average of 100 equity shares quoted on the London Stock Ex- 
change. The expiry date for these options was December 2001. 
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Exercise price Option price 
5125 475 
5225 405 
5325 340 
5425 2805 
5525 226 
5625 1794 
5725 139 
5825 105 


FTSE 100, 22 August, 2001 


0.192 


0.188 F 


0.186 f 


0.184 f 


0.182 - 


0.18 F 


Implied volatility 


0.178 F 


0.176 F 


0.174 b Current asset price 


172 
5100 5200 5300 5400 5500 5600 5700 5800 5900 
Exercise price 


0. 


Fig. 14.2. Implied volatility against exercise price for some FTSE 100 index 
data. 


The initial asset price (on 22 August 2001) was 5420.3. We took values of r = 
0.05 for the interest rate and T = 4/12 for the duration of the option. Figure 14.2 
shows the implied volatility computed for the eight different exercise prices. Of 
course, if the Black-Scholes formula were valid, the volatility would be the same 
for each exercise price. We see that in this example the implied volatility varies 
by around 10%. We also note that the implied volatility is higher for options that 
start in-the-money than for options starting out-of-the-money. This behaviour is 
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typical for data arising after the stock market crash of October 1987. Pre-crash 
plots of implied volatility against exercise price would often produce a convex 
smile shape; more recent data tends to produce more of a frown. 


14.6 Notes and references 


The convergence analysis for Newton’s method is based on the article (Manaster 
and Koehler, 1982). It is also mentioned in (Kwok, 1998). More about implied 
volatility can be found in (Hull, 2000; Kwok, 1998), for example. 

The widely reported phenomenon that the implied volatility is not constant as 
other parameters are varied does, of course, imply that the Black-Scholes formu- 
las fail to describe perfectly the option values that arise in the marketplace. This 
should be no surprise, given that the theory is based on a number of simplifying as- 
sumptions. Despite the disparities, the Black—Scholes theory, and the insights that 
it provides, continue to be regarded highly by both academics and market traders. 
Indeed, it is common for option values to be quoted in terms of ‘vol’; rather than 
giving C*, the o* such that C(o*) = C* in the Black-Scholes formula is used to 
describe the value. 

Many attempts have been made to ‘fix’ the nonconstant volatility discrepancy 
in the Black-Scholes theory. A few of these have met with some success, but 
none lead to the simple formulas and clean interpretations of the original work. 
Chapter 17 of (Hull, 2000) gives a good overview of the directions that have been 
taken. 


EXERCISES 


14.1. xx» Show that dC/do has a unique maximum over [0, œ) at o = F, 
where © is defined in (14.5). 

14.2. xx Verify the identity (14.6). 

14.3. xxx Suppose op = ©. Using the fact that F”(o) > 0 for o < C, confirm 
that (14.10) holds in the case where 6 > o*. 


14.7 Program of Chapter 14 and walkthrough 


In ch14, listed in Figure 14.3, we implement Newton’s method for implied volatility of a European 
call. After setting up r,S,E,T and tau, we use ch08 from Chapter 8 to compute the call value, 
C_true, corresponding to a volatility of sigma_true=0.3. Our task is then to recover the volatility 
that produces the call value C_true. We use a while loop of the form discussed for ch13, with a 
call to ch10 providing the required vega value. The final solution is correct to within 6 x 10-17, 
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%CH14 Program for Chapter 14 
% 
% Computes implied volatility for a European call 


APA” PPAP” parameters PHVLLWVV%% 
r = 0.03; S = 2; E = 2; T = 3; tau = T; sigma true = 0.3; 
[C_true, Cdelta, P, Pdelta] = chO8(S,E,r,sigma_true,tau); 
Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo oP PAPA 


%starting value 
sigmahat = sqrt(2*abs( (log(S/E) + r*T)/T ) ); 


PAPPP% Newton's method W%%%% 
tol = 1e-8; 
sigma = sigmahat; 
sigmadiff = 1; 
k=1; 
kmax = 100; 
while (sigmadiff >= tol & k < kmax) 
[C, Cdelta, Cvega, P, Pdelta, Pvega] = ch10(S,E,r,sigma,tau); 
increment = (C-C_true)/Cvega; 
sigma = sigma - increment; 
k = k+1; 
sigmadiff = abs(increment); 
end 
sigma 


Fig. 14.3. Program of Chapter 14: ch14.m. 


PROGRAMMING EXERCISES 


P14.1. Alter ch14 to deal with a put option. 


P14.2. Acquire some real option data, either electronically or via a newspaper, 
and create a figure like Figure 14.2. If possible, investigate the behaviour of the 
implied volatility as the expiry time varies. 


Quotes 


The volatility is the most important and elusive quantity 
in the theory of derivatives. 
PAUL WILMOTT (Wilmott, 1998) 


A smiley implied volatility is the wrong number 
to put in the wrong formula 
to obtain the right price. 
RICCARDO REBONATO (Rebonato, 1999) 


It is the strong opinion of the author that most traders 
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will gain an improved performance by concentrating their efforts 
on a better prediction of the volatility input into a Black-Scholes type model 
rather than introducing other pricing techniques. 
A. L. H. SMITH (Smith, 1986) 


In those days, before the publication of the Black-Scholes option-pricing formula, 
warrants were often grossly mispriced. Thorpe soon developed a computer program 
to identify such opportunities; its deployment was so successful that, 
by 1970, both Thorpe and Kassouf had abandoned academe for greener pastures. 
JAMES CASE, reviewing the book (Bass, 1999) in Society for Industrial and Applied 
Mathematics (SIAM) News, Jan/Feb, 2001. 
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Monte Carlo method 


OUTLINE 


e Monte Carlo 

e confidence intervals 

e Monte Carlo for option valuation 
e Monte Carlo for Greeks 


15.1 Motivation 


Chapter 12 showed that valuing an option can be regarded as computing an ex- 
pected value. The idea of using pseudo-random number generators to compute 
estimates of expected values was touched on in Chapter 4. Here we pull these two 
threads together and introduce the Monte Carlo approach to valuing an option. 
As we will see in Chapter 19, this provides a powerful means to compute option 
values in cases where no analytical formulas are available. 


15.2 Monte Carlo 


To begin, we consider the case of a general random variable X, whose expected 


value E(X) = a and variance var(X ) = b? are not known. Suppose 


e we are interested in computing an approximation to a (and possibly b), and 
e we are able to take independent samples of X using a pseudo-random number generator. 


We know from Table 4.2 that computing the average of a large number of samples 
can give a good approximation to the mean. Hence, if we let X1, X2,..., Xm 
denote independent random variables with the same distribution as X then we 
might expect 


1 M 
a Xi 15.1 
am MÈ (15.1) 
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to be a good approximation to a. We say that an approximation to E(X) is un- 
biased if it has the same expected value as X. It is easily shown that ay in 
(15.1) is unbiased; see Exercise 15.1. To estimate the variance, since var(X) := 
E((X — E(X))?), an obvious choice is Oy (Xj - am)*)/M. However, to make 
this estimate unbiased we need to re-scale it slightly. Exercise 15.2 asks you to 
check that the appropriate unbiased version is 


1 M 
bu = —— (Ki - am)’. (15.2) 
M-1 i=l 


By the Central Limit Theorem, ma X; behaves like an N(Ma, Mb?) random 
variable, so 


2 
am —a is approximately N (0 z) : (15.3) 
We could also say that am — a is approximately an N(O, 1) random variable scaled 
by b//M. This suggests that sampling ay for large M should give an approxima- 
tion to a that is correct to O(1/ JM). 
We can make this argument more quantitative by using the idea of a confidence 
interval that was introduced in Section 6.5. If we had equality in (15.3) then, from 


(6.15), 
1.96b 1.96b 
Pla- <ay <a + —— |= 0.95. 
( JM =) 


We may re-write this as 


1.96b 1.96 b 

P {am — <a <an + =R) = 0.95. (15.4) 
( /M vM 

The ratio b/v M appearing in (15.4) is often refered to as the standard error. Re- 

placing the unknown b by the approximation bm we see that the unknown expected 

value a lies in the interval 


L9%6bu a 
am — a 
MOO JM Y JM 


with probability 0.95, approximately. In other words (15.5) gives an approximate 
95% confidence interval for a. 

This analysis leads us to the basic Monte Carlo method for approximating a. We 
compute M independent samples and form ay in (15.1). In order to monitor the 
error, we also compute the variance approximation bi, in (15.2). Having by allows 
us to compute the confidence interval (15.5) (or indeed, a confidence interval for 
some other percentage, such as 99%; see Exercise 6.8). 


(15.5) 
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Fig. 15.1. Monte Carlo approximations to E(e~), where Z ~ N (0, 1). Crosses 
are the approximations, vertical lines give computed 95% confidence intervals. 
Horizontal dashed line is at height E(e“) = y/e. 


There are two key features to note. 


(i) The size of the confidence interval shrinks like the inverse square root of the number 
of samples. To reduce the ‘error’ by a factor of 10 requires a hundredfold increase in 
the sample size. This is a severe limitation that typically makes it impossible to get 
very high accuracy from a Monte Carlo approximation. 

(ii) The size of the confidence interval is directly proportional to the standard deviation, 
that is the square root of the variance, of the random variable under consideration. In 
practice, it is highly desirable to transform the problem of approximating E(X) to the 
problem of approximating E(Y) where Y is another random variable that has the same 


mean as X but a smaller variance. This idea, known as variance reduction, forms a 
vital part of practical Monte Carlo algorithms. The two most popular approaches are 
covered in Chapters 21 and 22. 


Computational example In Figure 15.1 we give results from a Monte Carlo 
simulation of E(e“), where Z ~ N(0, 1). In this case we can work out analyt- 
ically that E(e~) = \/e; see Exercise 15.3. We used 13 different sample sizes, 
M =2°,2°,27,...,2!7. For each sample size, the picture plots the computed 


mean, ay, with a cross and gives the computed 95% confidence interval as a 
vertical line, often called an error bar. Note that both axes have logarithmic 
scales. The exact mean, ye, is represented as a dashed line. We see that as M in- 
creases the computed mean generally becomes more accurate and the confidence 
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interval shrinks. In the third case, M = 2’, the correct mean is not contained in 
the confidence interval. Remember that our theory predicts that this will happen 
roughly 5% of the time, but requires M sufficiently large that 


e the Central Limit Theorem approximation is accurate, and 
e the computed variance by approximates well the exact variance b. 


A separate check revealed that with M = 2’ the variance error bt, — b?| was a 
non-negligible 7.1. The errors for M = 2! and M = 2" are 5.31 x 107° and 
3.64 x 1073, respectively. The ratio of these errors is ~ 1.5, which is close to 
the asymptotic (M —> co) value of 2. This computation is typical — we have 
achieved a few digits of accuracy with a modest amount of work. The ‘curse of 
the 1/./M’ makes higher accuracy extremely costly. To reduce the error to, say, 
1074 would take of the order of 108 samples, and to reduce it to 10~° would take 
of the order of 10!? samples; see Exercise 15.4. © 


15.3 Monte Carlo for option valuation 


We are now in a position to use Monte Carlo for option valuation. We consider 
a European-style option with payoff that is some function A of the asset price at 
expiry. Our model for the asset price at expiry is (6.8) with t = T. Using the risk 
neutrality approach discussed in Chapter 12, the time-zero option value can be 
found by setting u = r and computing e E(A(S(T))). 

Putting all this together, we wish to find the expected value of the random vari- 
able 


; 1 
eTA (so exp (> — 5e?) T+ avTz|) ; where Z ~ N(O, 1). (15.6) 


The resulting Monte Carlo algorithm can be summarized as follows: 


for i= 1 to M 
compute an N(0, 1) sample &; 
set S; = Spel 29 T +0 VTE 
set V; =e"? A(S;) 

end 

set ay = (a7) ye V; 

set by = (a) LAV - ay} 


The output provides an approximate option price ay and an approximate 95% 
confidence interval (15.5). 
Computational example We now use the Monte Carlo method to value a 
European call option, so A(S(T)) = max(S(T) — E,0). We will use the 
Black-Scholes formula (8.19) to compute the exact value and then see how 
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Fig. 15.2. Monte Carlo approximations to a European call option value. Crosses 
are the approximations, vertical lines give computed 95% confidence intervals. 
Horizontal dashed line is at height given by the Black-Scholes formula. 


well Monte Carlo performs. We take So = 10, E = 9, ø = 0.1, r = 0.06 and 
T = 1. The Black-Scholes option value is 1.5429. Figure 15.2 shows the Monte 
Carlo results, in a similar manner to Figure 15.1. We used sample sizes M = 
2°,2°,...,2!7. For each sample size we plot the computed mean ay with a 
cross and show the computed 95% confidence interval as a vertical line. The 
same pseudo-random number sequences as those for Figure 15.1 were used, and 
once again the M = 2” confidence interval does not contain the true mean. With 
the largest sample size, 2!7, the error in ay was © 1.2 x 1073. We emphasize 
that in this example there is no need to apply Monte Carlo as the Black-Scholes 
formula gives the exact solution conveniently. However, as we will see in Chap- 
ter 19, Monte Carlo comes into its own in more complicated circumstances where 
no simple Black—Scholes-type formula is available. © 


15.4 Monte Carlo for Greeks 


In addition to the option value, we know that the Greeks — the partial derivatives 
of the option value with respect to various quantities — are also of interest. In 
particular, the delta, A := 0V/dS, plays a key role in the hedging strategy that a 
trader must operate in order to replicate the option. The Monte Carlo approach can 
be used to compute approximate partial derivatives; although it must be handled 
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with care and may prove expensive. We focus here on the case of computing the 
time-zero delta of a European-style option with payoff A(S(7)), but the principles 
apply generally. 

A simple Taylor series expansion shows that the delta at asset price S and time 
t satisfies 


aV(S,t)  VS+h,t) -VG,2) 
as h 


+ O(h), ash —> 0. (15.7) 


Hence, we may choose a small value of h and use the finite difference approxima- 
tion 


aV(S,t) V(S+A,t)—V(S,t) 


~ 


as h 


This produces an approximation to delta at a single point based on option values at 
two points with slightly different S arguments. Using the risk-neutral, discounted, 
expected payoff formulation, we could thus approximate the time-zero delta by 
computing Monte Carlo estimates of the two expected values in 


_,-p E(A(S(T)), with S(O) = So) — E(A(S(T)), with S(0) = So + h) 
h ‘ 


e 


(15.8) 
Now the error in each of the two Monte Carlo estimates is O(1/ ./M) and hence, 
after dividing the difference by h, we expect an overall error of O(1/(hV/M)) 
for (15.8). This is unfortunate: we want to make h small to get a good derivative 
approximation in (15.7), but doing so forces us to take even more samples than 
the basic Monte Carlo option value strategy would need. Another way to view the 
difficulty is to note that in order to satisfy ourselves that we have even got the 
correct sign for the delta, we might ask for non-overlapping confidence intervals 
from the two Monte Carlo approximations. Since the exact means V (S,¢) and 
V(S +h, t) differ by O(h), this requires confidence interval widths that are at 
least as small as O(h). 

However, we can claw back some accuracy by noting that the two random vari- 
ables in (15.8) are highly correlated: for any particular asset path the payoff start- 
ing from S(O) = So is likely to be close to the payoff starting from So + A. In- 
tuitively, the corresponding sample mean errors should be close provided that we 
use the same paths for the two simulations; that is we apply Monte Carlo to the 
equivalent problem 


_-rEL(A(S(T)), with S(0) = So) — (A(S(T)), with S(0) = So + h)] 
i , 


e 
(15.9) 
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which involves a single random variable. In Chapters 21 and 22 we make the idea 
of correlation more explicit, and Exercise 22.3 gives further justification for this 
argument. 

This leads us to the following Monte Carlo algorithm for approximating A. 


fori = 1 to M 
compute an N(0, 1) sample £; 
set S; = Spel 29 IT +0 VTE 
set S? = (Sp + hye" 30 )T +0V TE: 
set A; = e~"7 (A(S;) — A(S?))/h 
end 
set ay = 7 Pu Aj 
set b? = gh, (Ai — am)? 


This produces an approximate delta value ay and an approximate 95% confidence 
interval (15.5). 


Computational example Here we return to the European call option used for 
Figure 15.2 with the same sample sizes M. The Black-Scholes time-zero delta 
value is 0.9558. Figure 15.3 shows the corresponding delta approximations from 
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Fig. 15.3. Monte Carlo approximations to time-zero delta of a European call 
option. Crosses are the approximations, vertical lines give computed 95% confi- 
dence intervals. Horizontal dashed line is at height given by the Black-Scholes 
formula. 
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the algorithm above, in the style of Figures 15.1 and 15.2. We fixed h = 1074. 
For M = 2!" , the error in the sample was © 1.2 x 1074; smaller than that for the 
corresponding Monte Carlo option value approximation. We also experimented 
with the corresponding algorithm that uses different pseudo-random numbers for 
the two options. The results were much worse; for the M values used here, no 
digits of accuracy were recorded, and the standard deviations were around 50 000 
times larger. © 


15.5 Notes and references 


There are many texts that discuss general Monte Carlo simulation. A ‘golden oldie’ 
that is still highly relevant is (Hammersley and Handscombe, 1964), whilst a short 
and very accessible modern perspective is given by (Madras, 2002). Monte Carlo, 
pseudo-random number generation and other simulation issues are treated in detail 
in (Ripley, 1987). 

Boyle’s classic 1977 paper (Boyle, 1977), which won the Journal of Financial 
Economics’ All-Star Paper Award 2002, introduced Monte Carlo for option val- 
uation. The paper (Boyle et al., 1997) summarizes developments since then, and 
in particular, has a detailed treatment of the Greeks. Texts that cover Monte Carlo 
for finance in some depth include (Clewlow and Strickland, 1998; Jackel, 2002; 
Kwork, 1998) 


EXERCISES 


15.1. x Show that ay in (15.1) is an unbiased estimator of E(X); that is, E(ay) = 
a. 
15.2. xxx» Show that 


A2 1 M 2 
b y := m% — am) 
satisfies 


6) = (EF) b?. (15.10) 


This confirms that bu? is not an unbiased estimator of var(X). Conclude 
from (15.10) that bm” in (15.2) is an unbiased estimator of var(X). 

15.3. xx Show that if Z ~ N(0, 1) then E(e%) = ye. [Hint: recall (3.8).] 

15.4. xx For the computational experiment that produced Figure 15.1, it was pre- 
dicted that ‘To reduce the error to, say, 1074, would take of the order of 108 
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%CH15 Program for Chapter 15 
% 
% Monte Carlo for a European put 


randn(‘state’,100) 


APPP” APAPAP” Problem and method parameters %%%%%%%%% 
S = 4; E = 5; sigma = 0.3; r = 0.04; T = 1; 

Dt = 1e-3; N = T/Dt; M = 1e4; 

Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo ooo P A P P P PA 


V = zeros(M,1); 

fori=1:M 
Sfinal = S*exp((r-0.5*sigma’2)*T+sigma*sqrt(T)*randn); 
V(i) = exp(-r*T)*max(E-Sfinal,0); 

end 

aM = mean(V); bM = std(V); 

conf = [aM - 1.96*bM/sqrt(M), aM + 1.96*bM/sqrt(M)] 


Fig. 15.4. Program of Chapter 15: ch15.m. 


samples, and to reduce it to 107° would take of the order of 101? samples.’ 
Where do these figures come from? For the computations in Figure 15.2, 
roughly how many samples would be needed to reduce the error to 107°? 


15.6 Program of Chapter 15 and walkthrough 


In ch15, listed in Figure 15.4, we use Monte Carlo to value a European put. The code follows 
the algorithm in Section 15.3, making use of MATLAB’s built-in functions mean and std, which, 
respectively, compute the sample mean (15.1) and sample standard deviation — the square root of the 
sample variance (15.2). 

The code produces a confidence interval conf = [1.0070, 1.0402]. Checking with the 
Black-Scholes formula from ch08 gives 


>> [C, Cdelta, P, Pdelta] = ch08(4,5,0.04,0.3,1) 


C = 0.2167 
Cdelta = 0.3226 
P = 1.0207 


Pdelta = -0.6774 


PROGRAMMING EXERCISES 


P15.1. Adapt ch15 to produce a picture like that in Figure 15.2. 
P15.2. Adapt ch15 to produce an estimate of the delta. 


150 Monte Carlo method 


Quotes 


To know the vintage and quality of a wine 
one need not drink the whole cask. 
OSCAR WILDE, 1854-1900. 


In the classical theory, which we are discussing here, 
the unknown parameter p is a number, not a random variable, 
so p is either in J or outside it, 
and it is meaningless to speak of the probability of p lying in J 
(the Bayesians, on the other hand, consider p a random variable — see Section 15.7). 
The expression 95% confidence interval refers to the 
procedure through which 7 was produced. 
This procedure produces intervals containing p 95 percent of the time. 
RICHARD ISAAC (Isaac, 1995) 


The Central Limit Theorem is a powerful tool, 
and we wish we had an intuitive explanation of why it should be true. 
Unfortunately, we don’t. 
MARK DENNEY AND STEVEN GAINES (Denney and Gaines, 2000) 
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Binomial method 


OUTLINE 


e description of the binomial method 
e derivation of the parameters 
e computational results 


16.1 Motivation 


We now introduce another computational approach. The binomial method is 
straightforward to describe and implement, and, as we will see in Chapters 18 
and 19, has the advantage that it is readily adapted to a range of non-European 
options for which no analytical formula is available. In particular, the bino- 
mial method provides the simplest means to value American options. In study- 
ing the method, we revisit two ideas, discrete asset price models and risk 
neutrality. 


16.2 Method 


The binomial method uses a simple discrete model for the asset price movement. 
We let ôt = T/M denote the spacing between successive time points, where T 
is the expiry date. So asset prices will be considered at times t; = iôt, for 0 < 
i < M. A key assumption in the binomial method is that between successive time 
levels the asset price moves either up by a factor u or down by a factor d. An 
upward movement occurs with probability p and a downward movement occurs 
with probability 1 — p. This scenario can be regarded as a simplified version of 
the discrete model introduced in Chapter 6. Indeed, Exercise 16.1 asks you to cast 
this simple model in the form (6.2) by redefining Y;. 

Since the initial asset price, So, is known, at time ft; = ôt the possible asset 
prices are uso and dSo. Similarly, at time f2 = 2ôt the possible asset prices are 
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Fig. 16.1. Recombining binary tree of asset prices. 


u?So, udSo and d?So. (The price udSo may arise from an upward movement 
followed by a downward movement or from a downward movement followed by 
an upward movement.) In general, at time t = t; := iôt there are i + 1 possible 
asset prices, which we label 


Si=d'"u"So, O<n<i. (16.1) 


Hence, at the expiry time t = ty = T there are M + 1 possible asset prices. The 
values si for 0 <n <i and 0 <i <M form a recombining binary tree, as illus- 
trated in Figure 16.1. 

For a European-style call option, the payoff at expiry has the form A(S(T)). 
Hence, if the asset has price Ned at time t = ty = T then the value of the option 
at that time is A (SM). Generally, we let V} denote the value of the option at time 
t = ti corresponding to asset price S1. We thus know that 


VM = A(S¥), O<n<M. (16.2) 
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Our task is to find Vo , the option value at time zero. We may do this by working 
backwards through the tree. Suppose Wit are known; that is, we have the 
option values corresponding to time f = ¢;+; and all possible asset prices. Then 
consider the option value V} corresponding to asset price S!, at time t = t;. Because 
of our up/down assumption about the asset price movement, working from right to 
left, the asset price Si comes either from S with probability p, or from S IE 
with probability 1 — p. Now, recall the definition (3.1) for the expected value of 
a discrete random variable. The big idea in the binomial method is to multiply 
the two possible values vg and vi +I by their associated probabilities to get an 
expected value. In this way the option value V, corresponding to asset price $, is 
taken to be IVS +d - pve, scaled by the appropriate factor that allows for 
the interest rate, r. This gives the fundamental relation 


yicem Gn +a - p)vi*') o  O<n<i, O<i<M-1. 
(16.3) 
Once the parameters u, d, p and M have been chosen, the formulas (16.1)— 
(16.3) completely specify the binomial method. The recurrence (16.1) shows how 
to insert the asset prices in the binomial tree. Having obtained the asset prices at 
time t = tm = T, (16.2) gives the corresponding option values at that time. The 
relation (16.3) may then be used to step backwards through the tree until V, the 
option value at time t£ = fo = 0, is computed. 


16.3 Deriving the parameters 


Since the discrete asset price model in the binomial method fits into the framework 
of (6.2), by appealing to Exercise 6.2 we could tune the parameters by asking for 
the corresponding Y; to have zero mean and unit variance. This would lead to 
two constraints. However, to give more insight into the workings of the method, 
we will derive those constraints from first principles. Exercise 16.5 asks you to 
confirm that the two approaches lead to the same conclusion. 

As a means to write down an expression for the up/down asset price model used 
in the binomial method, we define a random variable R; such that R; = 1 if the 
asset price goes up from time (i — 1)é¢ to iôt and R; = 0 if the asset price goes 
down. Hence, R; = 1 with probability p and R; = 0 with probability 1 — p. This 
means that R; is a Bernoulli random variable with parameter p, so from (3.2) and 
(3.14) we see that E(R;) = p and var(R;) = p(1 — p). After n time increments 
the asset has undergone }`;_;] R; upward movements and n — }>/_, R; downward 
movements. So the asset price S(ndr) at time t = nôt is given by 


S(ndt) = Sou2i=1 Rig n=}; Ri 
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We may re-arrange this to 


S(nét) _ gy? Or Ri 
So d ` 
Taking logs gives 


S(nôt) u 
log ( % ) = nlogd + log (=) = Ri. (16.4) 
Now, by the Central Limit Theorem, for large n the sum )~;_, R; behaves like 
a normal random variable. Hence, for large n, log(S(ndt)/So) will be close to 
normal. To match the continuous asset price model (6.8) used in the Black-Scholes 
analysis, we thus require the mean of log($(ndt)/So) to be (u — 50°)nst and the 
variance to be o2nét. Further, as the binomial method works with expected values, 


we impose the risk neutrality assumption u = r. This leads to the conditions 


plogu+(1—p)logd = (r — 40°) 8t, (16.5) 


u | ôt 


see Exercise 16.2. Regarding ôt = T/M as pre-specified, we now have two equa- 
tions in the three unknowns, p, u and d. In general, we can fix one of the three 
and solve for the other two. To pick out a particular solution this way, we may set 
p= 5 and solve to find that 


pee, «eg ONT ge ee, (16.7) 


see Exercise 16.3. 


16.4 Binomial method in practice 


The arguments in the previous section suggest that the binomial method asset 
model matches that used in the Black-Scholes analysis for small ôt, that is, large 
M. We may thus hope that the option values computed from the binomial method 
agree well with those from the Black-Scholes formulas, and that the agreement 
improves if M is increased. 


Computational example We use the binomial method to value a European 
put with So = 9, E = 10, T = 3, r = 0.06 and o = 0.3. Table 16.1 shows the 
results for M = 100, M = 200 and M = 400, along with the Black—Scholes 
value 1.4728. Our first observation is that with all three choices of M the bi- 
nomial method approximation is correct to at least two decimal places. The 
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Table 16.1. European put value 
approximations from binomial method 


Option value 


M = 100 1.4716 
M = 200 1.4762 
M = 400 1.4726 
Black—Scholes 1.4728 
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Fig. 16.2. Convergence of the binomial method for a European put as the num- 
ber of time points, M, increases. Upper picture: M goes from 20 to 250 in steps 
of 5. Dashed line is ‘exact’ solution. Lower picture: M goes from 200 to 400 in 


steps of 1. 


most accurate approximation of the three comes from the largest value of M, 
which is intuitively reasonable. However, it is perhaps surprising that M = 200 
gives less accuracy that M = 100. To check whether this is simply a quirk, 
the upper picture in Figure 16.2 shows the computed option value for M = 
20, 25, 30,..., 250, with the Black-Scholes value superimposed as a dashed 
line. We see that although the binomial method approximations do appear to 
converge as M increases, the convergence is by no means monotonic — tak- 
ing a slightly bigger M may worsen the error — and there is a general ‘saw- 
tooth’ pattern to the sequence of approximations as M increases. The lower plot 
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emphasizes the waviness. Here we have plotted the computed solution for all M 
between 200 and 400. The result appears to oscillate between two smooth curves, 
neither of which approaches the correct answer monotonically. © 


Two features stand out in Figure 16.2. 


(i) The binomial method approximation converges to the Black-Scholes value as 
M > œ. 
(ii) The convergence is not monotonic. 


These may be shown to be generic. Moreover, it is possible to describe the rate at 
which convergence takes place. Letting em = Ive — P (Sọ, 0)| denote the error in 
the binomial method approximation with ôt = T /M, it can be shown that there is 
a constant K such that 


eu < (16.8) 


=| 


In the upper picture of Figure 16.3 we display the errors in the example above for 
M between 100 and 400. The points have been joined by straight lines for clarity. 
The curve 1/M is added as a solid line, and we see that (16.8) appears to hold with 
K = 1. Taking logs in (16.8) gives log ey < log K — log M, showing that the log 
of the error as a function of log M should lie below a straight line of slope —1. 
The lower picture of Figure 16.3 re-scales the axes logarithmically to confirm this 
behaviour. 


16.5 Notes and references 


Cox, Ross and Rubinstein (Cox et al., 1979) wrote the original binomial method 
paper. Since then numerous authors have analysed and extended the ideas. 

It is possible to derive the parameters u, d and p from a number of different 
viewpoints. For example, with p = 5 the choice 


ue (1 + y e075t — 1) e d =e (1 — „erst — 1) (16.9) 


is common; see (Kwok, 1998; Wilmott et al., 1995). Exercise 16.4 shows that this 
is very close to the choice (16.7) for small ôt. 

Although much literature has been devoted to establishing that the error in vari- 
ous classes of binomial methods tends to zero as M — ov, surprisingly little atten- 
tion was initially paid to the rate of convergence. Leisen and Reimer (Leisen and 
Reimer, 1996) developed a general convergence rate theory, and the bound (16.8) 
follows from their results. A more detailed analysis, with explicit error constants, 
appears in (Walsh, 2003). 
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Fig. 16.3. Upper picture: Error in the binomial method for a European put as the 
number of time points, M, increases from 100 to 400. Solid line is 1/M. Lower 
picture: same data on a log-log scale. 


The odd—even ripples in the error, as depicted in Figures 16.2 and 16.3, have 
been widely reported. The references (Leisen and Reimer, 1996; Rogers and Sta- 
pleton, 1998) give explanations for the effect and propose fixes. 

Applying the binomial method may be shown to be equivalent to using a finite 
difference method to approximate the Black-Scholes PDE, a point that we pursue 
in Section 24.4. This is one means of proving that the binomial method solution 
converges to the Black-Scholes value as M — oo, see (Kwok, 1998), for example, 
and numerical analysis insights can also be used to explain the odd-even ripples. 

The book (Clewlow and Strickland, 1998) covers a number of practical issues 
in the implementation of the binomial method, and provides pseudo-code listings. 

A case study with the aim of making the binomial method run as quickly as 
possible in MATLAB is given in (Higham, 2002), along with downloadable codes. 

It is possible to compute Greeks via the binomial method. For partial derivatives 
with respect to S or t, approximations can be obtained using information from the 
tree. Exercise 16.8 illustrates the idea. Other partial derivatives can be treated by 
re-running the method with perturbed data, in the manner outlined in Section 15.4. 
Further details can be found in (Hull, 2000), for example, and (Walsh, 2003) shows 
that delta can be approximated to the same order of accuracy as the option value. 
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Binomial method 


EXERCISES 
x Consider the discrete asset price model used in the binomial method. 
Show that it may be written in the form (6.2) if we let Y; be defined as 
u—l—uôt : ET 
, with probability p, 
L k (16.10) 
r with probability 1 — p. 


xx Starting from (16.4) show that 


E [oe Gal = nlogd + log (5) np 
var [iog (=) | = (ioe ($) npc = p». 


Hence, obtain (16.5)—(16.6). 

xx Show that setting p = 5 in (16.5)-(16.6) produces (16.7). 

x x x For the parameters u and d in (16.7) show that 

u=1+ovV65t +rét + O(5t7/7), d=1-—oV6t+rét + O(5t?/"), 


as ôt — 0. 


and 


Show also that the corresponding u and d parameters in (16.9) have the 
same expansions up to O (8t?/?). [Hint: recall that V1 + x = 1+ 5x + 
O(x*) and eX =14+x+ 5x? + O(x3) asx > 0.] 

xx We know from Exercise 6.2 that if Y; in (16.10) has zero mean and unit 
variance, we recover the continuous asset price model in the limit ôt — 0. 
Set u =r and p= 5 and show that requiring E(Y;) = 0 and var(Y;) = 1 
in (16.10) leads to 


u=1+ovét+rét, d=1-—oVst+rét. 


Note that these values agree with those in Exercise 16.4 up to O(6t?/*). 
x x x Returning to the recurrence (16.3) we see that for M = 1 


Vo ape (pv! i 0 Soe pV ) , 
and for M = 2 
Vo =e" (pV, +0 — p)Vo) 
=e! (pe (pV? + 0 — p)V>) + — pje ™™ (pV; 
+a- p)Vo)) 
— post (p°v3 +2p(1— pV; +d - py Vo). 
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Similarly for M = 3 we find that 
Vise (pv? +3p° 0 — p)V3 +3p(1 — pV? + (0 — pve) . 


The coefficients {1, 1}, {1, 2, 1}, {1, 3,3, 1} are familiar from Pascal’s tri- 
angle. Having spotted this connection, prove by induction that 


M IM 
"EOD o) a- pve, (16.11) 
k=0 


where (z) denotes the binomial coefficient, 


M\ M! 
@) ~ kM — 1)! 


16.7. xx Letting 


write down the form of the i by (i + 1) matrix BË such that WË = Biw't!, 
16.8. x» Explain why the ratio (vi — V$) /(S i — SÈ) can be regarded as an ap- 
proximation to the time-zero delta. 


16.6 Program of Chapter 16 and walkthrough 


The program ch16 implements the binomial method for a European call. It is listed in Figure 16.4. 
First, parameters are initialized, using (16.7) for u and d. The quantity 
S*d.°7([M:-1:0]’).*u.*((0:M]’) 
is an M+1 by 1 array whose components cover the values sy S M NaN v in the expiry-time level 
of the asset price tree in Figure 16.1. Hence, the line 
W = max(S*d.7([M:-1:0]’).*u.*(([0:M]’)-E,0); 

contains the expiry time option values, as in (16.2). We then work through the iteration (16.3) by 
exploiting MATLAB’s colon notation to extract subarrays. The syntax 


exp(-r*dt)*(p*W(2:it+1) + (1-p)*W(1:i)); 
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%CH16 Program for Chapter 16 
% 
% Implements binomial method for European call 


VWVVVV%% Problem and method parameters %%%%%%%%%%% 
S =3;E = 2; T = 1; r = 0.05; sigma = 0.3; 

M = 400; dt = T/M; p =0.5; 

u = exp(sigma*sqrt(dt) + (r-0.5*sigma^2)*dt); 

d = exp(-sigma*sqrt(dt) + (r-0.5*sigma^2)*dt); 

Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo YoYo Po P P A% P P PP% 


% Time T option values 
W = max(S*d.^([M:-1:0]').*u.^([0:M]")-E,0); 


% Work back to option value at time zero 


for i = M:-1:1 
W = exp(-r*dt)*(p*W(2:14+1) + (1-p)*W(1:i)); 
end 


disp(’Option value is’), disp(W) 
Fig. 16.4. Program of Chapter 16: ch16.m. 


represents 


Wi+1 Wi 


The line for i = M:-1:1 sets up a loop that is repeated M times; first with i = M, then with i = 
M-1, and so on, down to i = 1. With this set-up, the dimension of W decreases by one each time 
around the loop. On exit, W is a scalar, whose value is ve 7 

Running ch16 .m produces the value 1.1175. To check, we may call ch08. 


>> [C, Cdelta, P, Pdelta] = ch08(3,2,0.05,0.3,1) 


C = 1.1175 
Cdelta = 0.9524 
P = 0.0200 


Pdelta = -0.0476 


PROGRAMMING EXERCISES 


P16.1. Alter ch16 so that the choice (16.9) for u and d is used. 
P16.2. Implement the binomial method via the formula (16.11). 
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Quotes 


“Would you tell me, please, which way I ought to go from here?’ 
“That depends a good deal on where you want to get to,’ said the Cat. 
‘I don’t much care where...’ said Alice. 
“Then it doesn’t matter which way you go,’ said the Cat. 
LEWIS CARROLL, Alice in Wonderland 


Sir, In your otherwise beautiful poem (The Vision of Sin) 
there is a verse which reads 
‘Every moment dies a man, 
every moment one is born.’ 
Obviously, this cannot be true and I suggest that in the next edition you have it read 
‘Every moment dies a man, 
every moment lis is born.’ 
Even this value is slightly in error 
but should be sufficiently accurate for poetry. 
CHARLES BABBAGE (in a letter to Lord Tennyson), source (Fröberg, 1985) 


In the literature, 
there are numerous contributions with limit proofs to European type options. 
Astonishingly, however, the convergence speed of binomially computed option prices 
has, so far, rarely been examined technically. 
Here, we present a theorem ... 

DIETMAR LEISEN AND MATTHIAS REIMER (Leisen and Reimer, 1996) 
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Cash-or-nothing options 


OUTLINE 


e cash-or-nothing call and put options 
e Black-Scholes formulas 

e Greeks 

e behaviour of delta 

e risk-neutral valuation 


17.1 Motivation 


We now take our first step away from vanilla Europeans and look at cash-or- 
nothing call and put options. There are three good reasons to look at these options. 


e They are widely traded, and hence of practical importance. 
e The corresponding Black-Scholes values can be found analytically. 
e They give us another opportunity to investigate the risk neutrality idea. 


17.2 Cash-or-nothing options 


A cash-or-nothing call option differs from a European call option in that the payoff 
at expiry is 


A, if S(T) > E, and 
0, if S(T) < E, 


where A > 0 is fixed. Holding this option amounts to making a straight bet that 
the terminal asset price will exceed the exercise price, E, that is, the European 
call will finish in-the-money. Winning the bet gets you A, losing the bet gets you 
nothing. Unlike the European case, there is no added value to be had from the asset 
exceeding the strike by a wide margin; the upside is limited to A. 

We have not yet specified the payoff for the case S(T) = E. This is an excep- 
tional event, technically it occurs with zero probability, so the resulting payoff is 
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Cash-or-nothing call 
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Fig. 17.1. Payoff diagrams for cash-or-nothing call and put. 


not important. But to be consistent with the formula that we derive, we will assume 
that A/2 is paid off in this at-the-money scenario, S(T) = E. 

Analogously, a cash-or-nothing put option differs from a European put option 
in that the payoff at expiry is 


0, if S(T) > E, and 
A, if S(T) < E. 


Holding this option amounts to making a straight bet that the European put will 
finish in-the-money. As for the call, we assume that A/2 is paid off if S(T) = E. 
Cash-or-nothing call and put payoff diagrams are shown in Figure 17.1. 

Cash-or-nothing options are sometimes called binary, or digital options, al- 
though these phrases are also used more generally when there is a discontinuous 
payoff diagram. 


17.3 Black-Scholes for cash-or-nothing options 


We will let C°*"(S, t) and P°*(§, t) denote the values of the cash-or-nothing 
call and put options, respectively, for asset price S and time t. 

The hedging argument used in Chapter 8 is very general — it requires only that 
the option value is a smooth function of S$ and t. Hence, we may ask for C“*"(S, t) 
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and P°h(S, t) to satisfy the Black-Scholes PDE (8.15). Specifying appropriate 
final time and boundary conditions is then sufficient to characterize the valuation 
formulas. 

There is a simple put—call parity relation connecting C°5"(S, t) and P(S, t), 
see Exercise 17.1, and hence we will focus on finding a formula for C°5"(S, t). 
The cash-or-nothing call payoff function gives final time conditions 


A, for S > E, 
lim Cers, t) = į A/2, for S = E, (17.1) 
R 0, for S < E. 


When S = 0, the asset remains at zero for all later times and hence the payoff is 
zero. This gives the boundary condition 


Ch, t) = 0, for allO <t <T. (17.2) 


When S is very large, the option is almost certain to pay off the amount A. So, 
after discounting for interest, we find that 


CNS: th ® Ae TTD, for large S. (17.3) 


Just as for the European case, imposing the final time and boundary conditions 
is enough to specify a unique solution. The solution turns out to have the simple 
form 


cens, t) = Ae ™ TDN (do), (17.4) 


where d> is the quantity (8.21) that appears in the European formulas. Our ap- 
proach for confirming that (17.4) is an appropriate solution will be to check that the 
formula satisfies the Black-Scholes PDE and the extra conditions (17.1)—(17.3). 
Exercise 17.2 asks you to do the latter. 

It is a straightforward exercise in differentiation to show that the partial deriva- 
tives appearing in the Black-Scholes PDE have the following form: 


aceash 2 Ae ™T-ÐN' (dz) 


delta); 17.5 
aS oS /T —t ( ) ( ) 
a2ccash Ae" —9 di N' (dəz) 
A 52 =— ISIT D (gamma); (17.6) 
accash 
Aa Are ™T-ÐN (dy) 
dı r 


+Ae7T-9 N' (dy) ( ) (theta); (17.7) 


WES) oTi 
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t 


Fig. 17.2. Black-Scholes surface for a cash-or-nothing call, with asset path 
superimposed. 


see Exercise 17.3. Inserting these expressions into the Black-Scholes PDE (8.15) 
we find that the expression 


accash eas a2c cash ac cash 
S ay = “cash 
eo 2 ae a 
takes the form 
_(T— ye dı r 
Are T—) N (dy) + Ae” TON (d ( - 
X ce NIG): dynes 


Ae TE —%VdiN'(da) Are "TÐ N’ (dp) 
2 (T-t) oȚvT =t 
which cancels down to zero, as required. 

Figure 17.2 gives a plot of the surface C°*"(S, t) and shows the option values 
mapped out by an asset path, in the style of Figure 11.3. For that path, the as- 
set is close to being at-the-money near expiry, and we see that the value changes 
dramatically as S(t) crosses the strike price £. 


Are” TDN (də), 


17.4 Delta behaviour 


The delta of an option is of special interest as it plays a key role in our hedging 
strategy. We see from (17.5) that the call delta is always positive. This behaviour 
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was also observed for European call options, and it can be explained in the same 
way — a positive payoff becomes more likely if the asset price increases. 
The behaviour of the delta at expiry can be summarized as follows: 


eash 0, forS > E, 
lim 5 = 10, forS=E, (17.8) 
0, for S < E, 


see Exercise 17.5. Recall that the delta is precisely the amount of asset that we hold 
in our replicating portfolio. For the in-the-money case, § > E, as the time to expiry 
shrinks the payoff is increasingly certain to be the constant A. Since there is no risk 
to eliminate, we should be holding a zero amount of asset. Similarly, if the option is 
out-of-the-money, S < E, as we approach expiry, the payoff is increasingly certain 
to be the constant 0, which is also riskless. The infinite at-the-money delta can be 
thought of as a consequence of the impossibility of hedging at a point where the 
payoff is discontinuous. Although expiring precisely at-the-money is a probability- 
zero event, the delta will be large if S(t) ~ E as expiry approaches. The practical 
consequences of a large delta are, of course, quite serious. For example, a large 
amount of cash needs to be withdrawn to maintain the delta-hedged portfolio and, 
ultimately, it will be impossible to purchase the necessary amount of asset — there 
will only be a finite supply available. This underlines the gap between theory and 
practice. 

We also note that the delta behaviour summarized in (17.8) is consistent with 
Figure 17.2. As we approach expiry the surface starts to look like two flat, hori- 
zontal sheets joined by a vertical sheet. 

In Figure 17.3 we plot the delta surface, as defined in (17.5). We chopped off 
the large heights that arise around S$ = E near expiry. A path is superimposed. To 
emphasize that large deltas can arise, we chose an asset that stumbles towards the 
strike price E near expiry. The ‘near infinite’ deltas close to expiry are too much 
for the plotter to handle. 


17.5 Risk neutrality for cash-or-nothing options 


We saw in Chapter 12 that there is a way to derive the Black-Scholes value for a 
European-style option that does not make direct use of the no arbitrage principle 
or the concept of hedging. Instead we impose the risk neutrality assumption u = r 
and compute the expected payoff, appropriately discounted for interest. 

We confirm directly in this section that the idea works for a cash-or-nothing call 
option. 
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delta 


0 s 


t 


Fig. 17.3. Black-Scholes delta surface for a cash-or-nothing call, with asset path 
superimposed. 


The payoff function A(-) appearing in (12.4) now has the form 


A, for x > E, 
A(x) = 4 A/2, forx = E, 
0, forx < E. 


Since the value of an integral does not change if we alter the value of the integrand 
at a single point, we may redefine A (E) = A, so that 


202(T —t) 


(log(x/S)—(u— gae) 
dx 


CO 
e 2 —-OR (payoff from S, t) = ee) 
(pay ) E ox /2n(T —t) 
(17.9) 


Exercise 17.6 asks you to confirm that when u = r, this reduces to the Black- 
Scholes value Ae~"— N (dz) from (17.4). 


17.6 Notes and references 


The terms binary and digital are not used with complete consistency in the litera- 
ture. We have fixed on the unambiguous cash-or-nothing (and asset-or-nothing in 
Exercises 17.7 and 17.8) in line with (Hull, 2000; Nielsen, 1999). 


17.1. 


17.2. 
17.3. 
17.4. 


17.5. 


17.6. 


17.7. 


17.8. 


17.6 Notes and references 169 


EXERCISES 


x x x By considering a portfolio consisting of a cash-or-nothing put and a 
cash-or-nothing call with the same strike prices and expiry dates, derive the 
“‘cash-or-nothing put—call parity’ relation 


CAN. t) + PRS, t) = Ae TT’, (17.10) 


xx Show that C°S(§, t) in (17.4) satisfies (17.1), (17.2) and (17.3). 

x x x Differentiate (17.4) to establish (17.5), (17.6) and (17.7). 

x x x Using (17.4) and (17.10), show that the value of the cash-or-nothing 
put option is 


Pes, t) = Ae TTÀ (1 = WN @)): 


Confirm that this formula has the required behaviour at S$ = 0 and in the 
limits ft — T~ and S —> oo. Also, show that it solves the Black—Scholes 
PDE (8.15). 

xx Establish (17.8). (You may use without proof the fact that ee!/® —> oo 
as £ > 0.) 

x x x Use the change of variable 


__ | log(S/x) + 0 — 50°)(T — t) 
JE ONT -t 


to show that, with u =r, the integral in (17.9) takes the value 
Ae™(T-ÐN (d2), where dz is defined in (8.21). 

xxx An asset-or-nothing call option has payoff function A(S(T)) of the 
form 


x, forx > E, 
A(x) = 4x/2, forx = E, 
0, forx < E. 


Draw the payoff diagram. Show that the risk-neutral approach of setting 
u =r ine’?! (payoff from S, t) produces the value SN (d1) for this 
option, where dı is defined in (8.20). How would the analogous asset-or- 
nothing put option be defined, and what is its value? 

xx Show that holding a European call option is equivalent to holding an 
asset-or-nothing call option (see Exercise 17.7 above) and writing a cash- 
or-nothing call with A = E, for the same expiry date. Use this to give an- 
other way to value the asset-or-nothing call option in Exercise 17.7. 
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%CH17 Program for Chapter 17 
% 
% Draws Black-Scholes surface for cash-or-nothing call 


clf 


APA” A” Parameters VPHYLVMVMWWWVVVV% 
E=1; A=2; r=0.05; sigma=0.2; T=1; L=50; 
Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Vo Vo Yo Yo Vo Yo Yo Yo Yo 


Svals = linspace(0,2,L); 
tvals = linspace(0,T,L); 
C = zeros(L,L); 


fori=4:L 
S = Svals(i); 
for j = 1:L-1 
t = tvals(j); 
tau = T-tvals(j); 
d2 = (log(S/E) + (r - 0.5*sigma’2)*(tau))/(sigma*sqrt(tau)); 
N2 = 0.5*(1+erf(d2/sqrt(2))); 
C(i,j) = A*exp(-r*tau)*N2; 
end 
% value at expiry 
C(i,L) = 0.5*A*(1+sign(S-E)); 
end 


[Smat,tmat] = meshgrid(Svals,tvals); 
mesh(Smat,tmat,C) 
xlabel(’t’), ylabel(’S’), zlabel(’C(S,t)’) 


Fig. 17.4. Program of Chapter 17: ch17.m. 


17.7 Program of Chapter 17 and walkthrough 


In ch17, listed in Figure 17.4, we plot a Black-Scholes surface for a cash-or-nothing call, in the style 
of Figure 17.2. The code is similar to ch11, except that we implement the formula (17.4) directly, 
rather than calling a separate function. To avoid division by zero errors, we deal with expiry, that is, 
j = L, after the inner loop. 


PROGRAMMING EXERCISES 
P17.1. Alter the binomial method program ch16 to handle the case of an asset-or- 
nothing call. 


P17.2. Alter the discrete hedging program ch09 to illustrate the difficulties that 
can arise when a cash-or-nothing option is delta-hedged. 
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Quotes 


Markets go up; markets go down. 
There is nothing insightful or sage about that observation. 
Derivatives, like any other market positions, are subject to this market risk. 
But while a normal investment may glide along a geometric path 
in response to changing market conditions, 
derivatives have special features that create 
erratic behaviour or that accelerate or exaggerate the results. 
PHILIP MCBRIDE JOHNSON (Johnson, 1999) 


There are many ways to play the game, 
but really only two kinds of players. 
Speculators and hedgers. 
Those scalping profits from churning markets, 
and those seeking shelter from the storm. 
Ninety-seven per cent of the daily churn in the financial markets is generated by 
speculators. 
This is the official figure, published by the Merc, 
which defends speculators as necessary for keeping the markets 
‘deep’ and ‘liquid’. 
Speculators stir the pits and hedgers pump them with funds, 
and between the two of them one gets the feeding frenzy known as 
the world financial markets. 
THOMAS A. BASS (Bass, 1999) 
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OUTLINE 


e American call and put 


equivalence of European and American call 


Black-Scholes for American put 


binomial method for American options 
e optimal exercise boundary 
e Monte Carlo for American options 


18.1 Motivation 


We now look at American options. These are typically more common than Euro- 
peans. The significant new feature here is the early-exercise facility. For put op- 
tions, this complicates the Black-Scholes analysis, places analytic formulas out of 
reach, and puts a strain on computational methods. 


18.2 American call and put 


An American option is like a European option except that the holder may exercise 
at any time between the start date and the expiry date. 


Definition An American call option gives its holder the right (but not the obli- 
gation) to purchase from the writer a prescribed asset for a prescribed price at 
any time between the start date and a prescribed expiry date in the future. © 


Definition An American put option gives its holder the right (but not the obli- 
gation) to sell to the writer a prescribed asset for a prescribed price at any time 
between the start date and a prescribed expiry date in the future. © 


The holder of an American option is thus faced with the dilemma of deciding 
when, if at all, to exercise. If, at time ¢, the option is out-of-the-money then it 
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is clearly best not to exercise. However, if the option is in-the-money it may be 
beneficial to wait until a later time where the payoff might be even bigger. 
American options are more widely traded than their European counterparts. In 
many exchanges, the early-exercise feature is offered by default. It is thus impor- 
tant to know how much extra value, if any, this flexibility builds in. 
The following argument, which is similar to one used in Section 2.6, shows that 
it is never optimal to exercise an American call option before the expiry date. 


As usual, let S(t) denote the asset price at time ¢ and let E denote the exercise price. Sup- 
pose the holder wishes to exercise the option at some time tf < T. This is only worthwhile 
if S(t) > E, and it gives a payoff of S(t) — E at time t. Instead, the holder could sell the 
asset short at the market price at time t and then purchase the asset at time t = T by doing 
the most favourable of 


(a) exercising the option at t = T, and 
(b) buying at the market price at time T. 


With this strategy the holder has gained amount S(t) > E at time ¢ and paid out an amount 
less than or equal to E at time T. This is clearly better than gaining S(t) — E at time t. 


Since it is never optimal to exercise an American call option before the expiry 
date, an American call option must have the same value as a European call option. 
Exercise 18.1 asks you to reach this conclusion by an alternative route. 

As we will see shortly, the same is not true for put options. 


18.3 Black-Scholes for American options 


Our aim in this section is to show how the arguments in Chapter 8 that led to the 
Black-Scholes PDE can be adapted to cover an American put option. We write 
PA™(§, t) to denote the American put option value at asset price S and time t, and 
use A(S(t)) = max(E — S(t), 0) for the corresponding payoff function. 

Our first observation is that 


PANS t)> A(S), — forallO<t <T, S>O. (18.1) 


This follows from a simple arbitrage argument. If PA™(§,t) < A(S(t)) then an 
instantaneous profit can be made by purchasing the option and immediately exer- 
cising it. We know from Figure 11.1 that this inequality does not hold universally 
for a European put, so, for put options, the early-exercise feature does make a 
difference to the value. 

Now we return to the replicating portfolio idea of Chapter 8. We may repeat the 
arguments up to the point (8.13) where A(V — II), or in our case, A(PA™ — TI), 
is deemed to be riskless. We now try to take the next step, which gave the equality 
(8.14). 


18.3 Black-Scholes for American options 175 


Case 1: A(P4™ — TI) > r(P4™ — TI). Here, the combination P“™ — TI does better than 
cash in the bank. We argued that this could be exploited by buying P^™ — TI, that 
is, buying the option and selling TI (short selling the asset and loaning out the 
cash). 

Case 2: A(P4™ — TI) < r(P4™ — TI). Here, the combination P“™ — TI does worse than 
cash in the bank. We argued that this could be exploited by selling P4™ — TI, that 
is, selling the option and buying IT (buying the asset and borrowing the cash). 


Without the early exercise facility, the no arbitrage principle rules out both cases. 
With early exercise, however, the story changes. In Case 1, the arbitrageur buys the 
option and hence controls the exercise facility. This extra freedom can only help 
the arbitrageur and hence the arbitrage possibility persists. On the other hand, in 
Case 2 the putative arbitrageur sells the option, and is at the mercy of the early 
exercise facility. The arbitrageur may be exercised against at any time, and can no 
longer guarantee to beat the bank risklessly. 

Overall, for an American put, the no arbitrage principle rules out Case 1, but 
not Case 2, and we conclude that (8.15) changes to 
Am 2 pAm Am 
ai 29 ui trs =p ph" <0. (18.2) 
Note that (18.2) is a partial differential inequality. Now, at any point (S, t) it will 
be optimal to either (a) exercise, or (b) hold on to the option, and hence 


+ 50S 


for each S, t one of (18.1) and (18.2) is at equality. (18.3) 


The three components (18.1), (18.2) and (18.3) are the key features in the the- 
ory of American option valuation. Together they form what is known as a linear 
complementarity problem. 

At expiry, if the option is still held, its payoff matches the European, so we have 
the final time condition 


PA"(S T) = A(S(T)), for all $ > 0. (18.4) 


For S = 0, the asset always has price zero, so a payoff of E is assured. In this case 
it is optimal to exercise immediately. We may interpret this formally as a boundary 
condition of the form 


PA™(S,t) > E, as S > 0, forallO <t <T. (18.5) 


Similarly, if S is large, then the option is extremely unlikely to produce a positive 
payoff, so we have 


PPS t) — 0, as S > o, forallO <t <T. (18.6) 
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The mathematical problem defined by (18.1)—(18.6) is much more difficult than 
the Black-Scholes PDE that arose without the early exercise facility. In general, 
there is no closed form expression for P Am(S, t) and we must use numerical meth- 
ods to obtain approximate values. 


18.4 Binomial method for an American put 


It turns out that a straightforward adaptation of the binomial method can be used to 
value an American put. We recall from Chapter 16 that asset prices in the binomial 
model are determined by (16.1). If the put option is held until its expiry date then 
(16.2) applies. Now, working backwards through the tree, if the option is retained 
at time t = t; and asset price S n then the value vi is given by (16.3). However, 


exercising the option would produce A(SŻ). Hence, choosing the best of the two 
possibilities leads to the relation 


vi = max [acsi e7” (pvit +a =- pvi*!)], 
O<n<i, O<i<M-1. (18.7) 


All together, (16.1), (16.2) and (18.7) completely specify the binomial method for 
computing the time-zero option value ve. 


Computational example We now use the binomial method to value an 
American put with the same parameter values as those in Section 16.4, that is, 
So = 9, E = 10, T = 3,r = 0.06 and o = 0.3. Table 18.1 shows the results for 
M = 100, 200, 400 and 1000. If we regard the M = 1000 result as accurate 
then we see that, as in the European case (Table 16.1), the method appears to 


converge, but does so in a nonmonotonic manner. Figures 18.1 and 18.2 give 
the American versions of the binomial method computations displayed in Fig- 
ures 16.2 and 16.3. We see that a very similar convergence behaviour arises. 
Indeed, it can be shown that an error bound of the form (16.8) continues to 
hold. Ko 


Table 18.1. American put value 
approximations from binomial method 


Option value 


M = 100 1.7974 
M = 200 1.7983 
M = 400 1.7962 


M = 1000 1.7962 
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Fig. 18.1. Convergence of the binomial method for an American put as the num- 
ber of time points, M, increases. Upper picture: M from 20 to 250 in steps of 5. 
Dashed line is ‘exact’ solution. Lower picture: M from 200 to 400 in steps of 1. 


18.5 Optimal exercise boundary 


If S is large, since there would be no payoff, it cannot be worthwhile to exercise an 
American put; it is optimal to hold on to the option. On the other hand, in the limit 
S — 0, the payoff from exercising approaches the maximum possible value that 
we can attain; it is optimal to exercise. Interpolating between these two extremes, 
we might expect there to be a well-defined optimal exercise boundary, S*(t), such 
that 


e for S(t) < S*(t) it is optimal to exercise, so P4™(S, t) = A(S(t)), and 
e for S(t) > S*(t) it is optimal to hold, so PA™(S, t) > A(S(t)). 


Figure 18.3 shows the value P4™(§,t) as a function of S, for t fixed. We 
set E = 10, r = 0.06, o = 0.3 and T = 1, and considered t = T /4. We used the 
binomial method with a wide range of initial asset prices Sg to compute values of 
PA™(S, T /4). The figure shows that for small S the option value lies on the hockey 
stick A(S(t)), which is superimposed as a dashed line. For S bigger than some 
level S*(T /4), the value P4™(S, T /4) lies above the hockey stick. It can also be 
shown that the derivative 3 PA™(S*(t), t)/3S = —1, so at the point $*(t) the curve 
P4A™(§(t), t) leaves the hockey stick smoothly, with a matching first derivative. 
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Error in binomial method 
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Fig. 18.2. Upper picture: Error in the binomial method for an American put as 
the number of time points, M, increases from 100 to 400. Solid line is 1/M. 
Lower picture: same data on a log-log scale. 


American put value 


Fig. 18.3. Value P“™(S, T /4) for an American put, computed via the binomial 
method. Hockey-stick payoff function A(S) is superimposed as a dashed line. 
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12 


10 Do not exercise E 


4 Exercise 


Fig. 18.4. Exercise boundary for an American put. Computed via the binomial 
method. 


Exercise 18.2 asks you to go half-way towards proving this, by establishing —1 as 
a lower bound. 

In Figure 18.4 we explicitly compute the optimal exercise boundary S*(t) for 
the same E,r, o and T as used in Figure 18.3. The boundary is shown as a solid 
curve — below this curve it is optimal to exercise and above this curve it is op- 
timal to hold on. At t = T/4 we have S*(t) = 7.3, which agrees with the point 
on the horizontal axis in Figure 18.3 where P“™(S, T /4) leaves the hockey stick. 
We tracked the optimal exercise boundary by applying the binomial method with a 
range of initial asset prices, Sg. At each time point, t;, we defined S*(t;) to be 
the smallest value of 5i over all binomial trees for which the e~"* (pvit + 
(1 — pV ir) term in (18.7) dominated the A(S!) term. In other words, S*(t;) 
was taken to be the smallest Si for which the binomial method chose not to 
exercise. 

It can be shown that Figure 18.4 is generic in the sense that 


@) S(T) =E, 


(ii) S*(t) is a well-defined, single-valued function of t, 
(iii) $*(t) is a nondecreasing function of t. 


Exercise 18.3 deals with points (i) and (iii). 
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18.6 Monte Carlo for an American put 


We have seen that the binomial method has a natural extension from European to 
American options. The same is not true for the Monte Carlo method. This mis- 
match has two sources. 


(a) Monte Carlo deals with single paths, whereas the binomial method essentially averages 
over paths automatically. 
(b) Monte Carlo works forward in time, whereas the binomial method runs backwards. 


Monte Carlo for European options exploits the idea that the value can be ex- 
pressed as an expectation. In the American case there is an analogous, but less 
computationally useful, representation. Under the risk neutrality condition u =r, 
the time-zero American put value may be expressed as 

P^™(So, 0) = sup Efe’ A(S(z))], (18.8) 


0<t<T 


where t is a stopping time. To define a stopping time precisely requires technic- 
alities that have not been developed in this book, but the expression (18.8) can be 
described informally as follows. 


e The value taken by t determines the time at which the option is exercised. So 
e’* A(S(t)) in (18.8) represents the discounted payoff. 

e The quantity t is a random variable that depends upon the asset path S(t). 

e Any rule that specifies t as a function of the asset path S(t) can be used, with the proviso 
that the decision to set t = ¢* can only use information about S(t) for 0 < t < t*. 

e The option value P“™(So, 0) is given by using the rule for determining t that leads to 
the biggest expected payoff, suitably discounted for interest. 


Putting this in words: 


Imagine all possible exercise strategies, that is, all possible rules for determining when to 
exercise the option. Suppose we judge the success of a strategy by its discounted expected 
payoff. Then we recover the Black-Scholes American put option value if we use the best 
out of all those exercise strategies that do not look forward in time — those that take an 
exercise decision at each point in time using only information about the asset price up to 
that time. 


From a computational perspective, an enormous hurdle in (18.8) is the require- 
ment to optimize over all allowable exercise strategies. It is impossible to write 
down all such strategies in any useful way, let alone optimize over them! To 
illustrate the idea, we restrict ourselves to a very simple class of allowable ex- 
ercise strategies. Suppose we decide to exercise the option at time ¢ if the dis- 
counted payoff, e~"’ A(S(t)), exceeds some fixed level a > 0. If we reach the ex- 
piry date, T , and have not yet exercised the option, then it makes sense to exercise 
if A(S(T)) > 0. Overall, our exercise strategy may be written as follows. 
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Fig. 18.5. Asterisks are Monte Carlo approximations to the discounted expected 
American put payoff with a simple exercise strategy parametrized by a. Upper 
and lower horizontal lines show the true American and European values. 


e Exercise at time t if e [E — S(t)] > a. 
e If we reach T, exercise if E — S(T) > 0. 


This is an allowable strategy, as the decision about whether to exercise at time t 
uses only S(t). In Figure 18.5 we measure the success of this approach. Here we 
valued an American option with Sg = 9, E = 10, T = 1, r = 0.06 and o = 0.3. 
The Black-Scholes value, computed via the binomial method, was found to be 
1.43. The corresponding European put option value is 1.32. These values are in- 
dicated as horizontal lines. The asterisks in the figure show the Monte Carlo ap- 
proximations to the option value, using the exercise strategy above, with a range of 


choices for a. More precisely, we computed 10° risk-neutral discrete asset paths, 
with a time spacing of ôt = 1073, and applied the strategy at each discrete time 
point i ôt. Confidence intervals for the sample means were smaller than the size of 
the asterisks in the plot. We see from the figure that if œ is taken to be around 2.5, 
the discounted expected payoff is close to the Black-Scholes value. Exercise 18.4 
asks you to explain the results for 0 < a < 1 and « large. In this example, we 
are fortunate that optimizing over the parameter œ in our simple class of exercise 
strategies gives an answer that is close to the optimal over all allowable strate- 
gies. Of course, if we were to change So, E, T, r or o then the optimal œ would 
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certainly change, and there is no guarantee that it would give a good approximation 
to PA™ (So, 0). 

In general, picking any particular allowable strategy and computing the dis- 
counted expected payoff will lead to a lower bound on the true Black-Scholes 
value. 

By contrast, we could allow ourselves the luxury of peeking into the future in 
order to select the best possible exercise times. 


e Consider the whole path S(t) for 0<+<T, and exercise where e7"' A(S(t)) is 
maximized. 


For each asset path, this strategy does at least as well as the best allowable strategy. 
Hence, the corresponding discounted expected payoff gives an upper bound on the 
Black-Scholes value. In the example of Figure 18.5 the upper bound was 2.62, 
which, as is typical, is too crude to be of much use. 


18.7 Notes and references 


Our derivation of the linear complementarity problem (18.2)-(18.6) followed 
closely the treatment by Almgren (Almgren, 2002). It is possible to write the 
American put valuation problem in terms of a PDE that explicitly involves the op- 
timal exercise boundary, S*(t). This free boundary problem approach is described 
in (Kwok, 1998; Wilmott et al., 1995), for example. Kwok (Kwok, 1998) gives 
examples of more complex options with early exercise features for which the ex- 
ercise and non-exercise regions are made up of disconnected sets. The condition 
that aPA™(S*(t), t)/AS = —1, which we illustrated in Figure 18.3, is discussed in 
detail in (Kwok, 1998) and (Wilmott et al., 1995). 

Convergence of the binomial method for American options is treated in (Leisen, 
1998), where an error bound of the form (16.8) is derived. 

The argument in Section 18.2 that shows the equivalence of European and 
American call values fails to hold when the asset pays dividends, see (Hull, 
2000; Kwok, 1998; Wilmott et al., 1995), for example, for details of how the theory 
can be adapted. 

Applied mathematicians have recently become interested in the nature of the 
optimal exercise boundary for t ~ T. It can be shown that as the boundary S*(t) 
approaches E as t — T7, its tangent becomes unbounded, as may be observed in 
Figure 18.4. The precise nature of this singularity is explored in (Goodman and 
Ostrov, 2002; Kuske and Keller, 1998), for example. 

Bjork (Bjork, 1998) is a good source for the mathematics behind (18.8). 

Until quite recently, most researchers believed that a Monte Carlo approach 
could not be used for valuing American options. However, a number of authors 
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now argue that, with appropriate extensions, competitive Monte Carlo based com- 
putational algorithms are achievable; see (Anderson and Broadie, 2001; Boyle 
et al., 1997; Fu et al., 2001; Longstaff and Schwartz, 2001; Rogers, 2002), for 
example. 


EXERCISES 


18.1. xxx Repeat the analysis in Section 18.3 for the case of an American call 
option. Show that the Black-Scholes European call option formula (8.19) 
satisfies the relevant analogues of (18.2)-(18.6). Deduce that an American 
call option has the same value as the corresponding European call option. 

18.2. xx In Section 18.5 it was mentioned that 3 P^™(S*(t), t)/3S = —1. Give a 
simple explanation why 0 P4™(S*(t), t)/ðS cannot be less that —1. 

18.3. xxx» Given that there is a well-defined, single-valued optimal exercise 
boundary function $*(t) for an American put, show that S*(T) = E and 
that S*(t) is a nondecreasing function of t. 

18.4. x» Explain the behaviour of the Monte Carlo approximations in Figure 18.5 
for 0 < a < 1 anda large. 

18.5. xx Which of the following exercise strategies are allowable in (18.8)? 


Strategy 1: 


e Exercise at time t if S(t) < SE. 
e If we reach T, exercise if E — S(T) > 0. 


Strategy 2: 


e Exercise at time t if S(t) < min(£, 1.1 ming<;<7 S(r)). 
e If we reach T, exercise if E — S(T) > 0. 


Strategy 3: 


e Exercise at time ¢ if S(t) < min(E, 5 ming<r<r/2 S(r)). 
e If we reach T, exercise if E — S(T) > 0. 


18.8 Program of Chapter 18 and walkthrough 


In ch18, listed in Figure 18.6, we give a modified version of ch16 that values an American 
put with the binomial method. After initializing parameters, we create the one-dimensional 
array dpowers with entries d™,d™-!,...,d? and the one-dimensional array upowers with entries 
u?, ul, ehh u™ | It follows that Sxdpowers.*upowers gives the asset values ae SM: Hots aT at 


expiry in the asset price tree of Figure 16.1, and 


S*dpowers(M-i+2:M+1) .*upowers(1:i); 
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%CH18 Program for Chapter 18 
% 
% Implements binomial method for an American put. 


VVVV%% Problem and method parameters %H%%%%%%%% 
$=3;E=4;T=1; r = 0.05; sigma = 0.3; 

M = 400; dt = T/M; p =0.5; 

u = exp(sigma*sqrt(dt) + (r-0.5*sigma‘2)* dt); 

d = exp(-sigma*sqrt(dt) + (r-0.5*sigma’2)* dt); 

Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo P P P P P PA 


% Initial computations 
dpowers = d.^([M:-1:0]'); 
upowers = u.^([0:M]"); 


% Time T option values 
W = max(E-S*dpowers.*upowers,0); 


% Work back to option value at time zero 
for i = M:-1:1 

Si = S*dpowers(M-i+2:M+1).*upowers(1:i); 

W = max(max(E-Si,0),exp(-r* dt)*(p*W(2:1+1)+(1-p)*W(1:i))); 
end 


disp(’Option value is’), disp(W) 
Fig. 18.6. Program of Chapter 18: ch18.m. 


gives the asset values s, si, say S; at the ith time level. In this way, the iteration (18.7) is enscap- 
sulated as 


W = max(max(E-Si,0),exp(-r*dt)%»(p*W(2:i+1)+(1-p)*W(1:i))); 


As with ch16, the loops exits with a scalar value for W that gives the option value ve , 
The option value output by ch18.m is 1.0158. The validity of the result will be confirmed by 
ch24 in Chapter 24. 


PROGRAMMING EXERCISES 
P18.1. Alter ch18 in order to re-create Figure 18.4. 


P18.2. Think up an allowable exercise strategy and test it in the manner of 
Figure 18.5. 


Quotes 


Although simulation is a powerful tool 
for solving some higher-dimensional problems, 
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conventional wisdom was that 
simulation could not be applied to American-style pricing problems. 
The algorithms described here represent the first attempts to solve these problems 
that were long thought to be computationally intractable. 
PHELIM BOYLE, MARK BROADIE AND PAUL GLASSERMAN (Boyle et al., 1997) 


Academia was teeming with nerdy mathematicians who had been publishing 
unintelligible dissertations on markets for years. 
Wall Street had started to hire them, but only for research, 
where they’d be out of harm’s way. 
On Wall Street, the eggheads were stigmatized as ‘quants’, 
unfit for the man’s game of trading. 
ROGER LOWENSTEIN (Lowenstein, 2001) 


I prefer the judgement of a 55-year old trader 
to that of a 25-year old mathematician. 
ALAN GREENSPAN, source (Taleb, 1997) 
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Exotic options 


OUTLINE 


e European-style options 

e path-dependent options: lookbacks, barriers and Asians 
e early exercise options: Bermudans and shouts 

e Monte Carlo and binomial methods 


19.1 Motivation 


So far, we have seen European options and American-style options. A bewildering 
array of alternatives are also available; these go by the general name of exotic 
options. Each type of option is distinguished by 


(i) the nature of its path dependency — the way in which the payoff depends upon the asset 
path S(t) forO < t < T, and 
(ii) whether early exercise is allowed. 


In many cases, exact expressions for the option value are not available, and hence 
approximations must be computed. This chapter introduces some of the less eso- 
teric exotics and discusses the use of our two computational algorithms: the bino- 
mial and Monte Carlo methods. A third computational approach, numerical solu- 
tion of a Black-Scholes PDE formulation, is covered in Chapters 23 and 24. 


19.2 Barrier options 


Barrier options have a payoff that switches on or off depending on whether the 
asset crosses a pre-defined level. 


e A down-and-out call option has a payoff that is zero if the asset crosses some pre- 
defined barrier B < Sp at some time in [0, T]. If the barrier is not crossed then the 
payoff becomes that of a European call, max(S(T) — E, 0). 
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Asset 


Time 


Fig. 19.1. Two asset paths and a barrier, B. The thicker asset path crosses the 
barrier and hence would give zero payoff in a down-and-out call. The thinner asset 
path fails to cross the barrier and hence would give zero payoff in a down-and-in 
call. 


e A down-and-in call option has a payoff that is zero unless the asset crosses some pre- 
defined barrier B < So at some time in [0, T]. If the barrier is crossed then the payoff 
becomes that of a European call, max($(T) — E, 0). 


One reason for the popularity of barrier options is that, because the payoff op- 
portunities are more limited, they are cheaper to buy than Europeans. Figure 19.1 
illustrates the idea. Here, two asset paths are shown. Both expire above the ex- 
ercise price: S(T) > E. Despite finishing the higher, the thicker of the two paths 
dips lower, crossing the barrier. The thicker path would give a nonzero payoff for a 
down-and-in call, but a zero payoff for a down-and-out call. Conversely, the thin- 
ner path would give a zero payoff for a down-and-in call, but a nonzero payoff for 
a down-and-out call. 

The hedging idea from Chapter 8 remains valid for barrier options. Let CB(S, t) 
denote the value of a down-and-out call option at asset price S and time t. The 
Black-Scholes PDE (8.15) is relevant unless the barrier is crossed, so C3(S, t) 
must satisfy the PDE on the domain 0 < t < T, B < S. If S = B then the option 
becomes worthless, giving 


C®(B,t) =0, for0 <t<T. (19.1) 
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== Down-and-out 
= = European 


Call value 


Fig. 19.2. Time-zero down-and-out call value (19.3) as a function of S. 


Also, at expiry, for S(T) > B we must recover the European value, so 
c®(S,T) =C(S,T), for B < S. (19.2) 


Here, C (S, t) denotes the European value (8.19). In the case B < E it can be 
shown that a solution to the Black-Scholes PDE on the domain 0 < t < T, B < S, 
that satisfies (19.1) and (19.2) is given by 


CB(S,t) = C(S, t) — (S/B)! 72/9? C(B2/S, t); (19.3) 


see Exercise 19.2. 

We note that (19.3) immediately confirms that the down-and-out call is worth 
less than the European call. 

A plot of the time-zero value C B(S,0) in (19.3) for B < E is given in 
Figure 19.2. The European value is also shown. As we would expect, as the initial 
asset price increases, and so the probability of hitting the barrier decreases, the 
down-and-out call value approaches that of the European. 

Given a formula for a down-and-out call, the corresponding down-and-in can 
be found from the relation 


in + out = European, (19.4) 


see Exercise 19.3. 
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Replacing ‘down’ by ‘up’ gives another class of barrier options. 


e Anup-and-out call option has a payoff that is zero if the asset crosses some pre-defined 
barrier B > So at some time in [0, T]. If the barrier is not crossed then the payoff be- 
comes that of a European call, max(S(T) — E, 0). 

e An up-and-in call option has a payoff that is zero unless the asset crosses some pre- 
defined barrier B > Sọ at some time in [0, T ]. If the barrier is crossed then the payoff 
becomes that of a European call, max($(T) — E, 0). 


There are also, of course, put versions of the above calls; just replace the word 
‘call’ by ‘put’ in each case. This gives a total of eight different up/down-and-in/out 
calls/puts. In each case, an analytical formula for the option value can be obtained 
by solving the Black-Scholes PDE with appropriate final time and boundary con- 
ditions. Formulas for each type of barrier option can be found via the references 
in Section 19.7. As an example that we will return to in Section 19.6, we give the 
formula for an up-and-out call: 


B 1+2r /o? 
S (ra — N (ei) — (=) (N (f2) — ee) 


—1+42r/o 
-Ee T-t) (ra - N (e3) — (5) NCG) — ea) . (19.5) 


Here, dı and dp are defined in (8.20) and (8.21) and 


__ log($/B) + (r + + 40°)(T — D 
7 ONT -t 
log(S/B) + (r — 40°)(T — 2 


a ONT -t 
_ log(S/B) - (r = 30°)(T — D 
ONT -t 
log(S/B) — r + 50°)(T — t) 
ae ONT -t , 
log(SE/B*) — (r — 50°)(T — t) 
= ONT -t , 


log(SE/B*) — (r + 50°)(T — t) 
= oJT 1 
Figure 19.3 plots the up-and-out call value (19.5) at time zero, along with the 


corresponding European. The picture illustrates that barrier options can be signif- 
icantly cheaper than Europeans. The up-and-out call has a limited up-side — the 
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=== Up-and-out 
= = European 


Call value Pid 


Fig. 19.3. Time-zero up-and-out call value (19.5) as a function of S. 


payoff cannot exceed B — E, and hence can be bought for much less than the 
European version. 
There are many generalizations of those eight basic barrier options. 


e Double barrier options impose upper and lower bounds on the asset price, and payoff 
may knock in (or out) if either barrier is (or both barriers are) crossed. 

e Partial barrier options have barriers that apply for a limited time interval. 

e Parisian options have barriers that must remain crossed for some pre-specified amount 
of time. 

e More generally, the barrier may be time-dependent and the nature of the option may be 
re-set (e.g. to another barrier option) if a barrier is crossed. 


Although the Black-Scholes analysis remains relevant in all cases, the more com- 
plicated barrier options do not admit analytical expressions for the value. 


19.3 Lookback options 


The payoff for a lookback option depends upon either the maximum or the mini- 
mum value attained by the asset. There are two broad categories, fixed and floating 
strikes. In describing them, we use the notation 


Soe max S(t) and smn :—= min S(t) 
[0,7] [0.7] 
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to denote the extreme asset values. 


e A fixed strike lookback call option has payoff at the expiry date T given by max(S™* — 
E,0). 

e A fixed strike lookback put option has payoff at the expiry date T given by max(E — 
gmin 0). 

e A floating strike lookback call option has payoff at the expiry date T given by S(T) — 
Smin- 

e A floating strike lookback put option has payoff at the expiry date T given by S™* — 
S(T). 


These lookback options are clearly more valuable than the corresponding Euro- 
peans. The fixed strike lookbacks differ from European options in that the final 
asset value S(T) is replaced by the ‘best’ asset price — the maximum in the case 
of a call and the minimum in the case of a put. With a floating strike, the exercise 
(strike) price becomes the extremely favourable minimum asset price for a call and 
maximum asset price for a put. In the floating case it will always be worthwhile to 
exercise, so the word ‘option’ is perhaps inappropriate. 

It is possible to derive Black-Scholes formulas for the four lookback cases 
above, see Section 19.7 for references. There are many extensions of these ideas, 
typically designed to offer some of the lookback desirability at a cheaper price; 
for example by looking back over a limited time period or over a finite number of 
points in time. In many cases, the options may only be valued by computational 
means. 


19.4 Asian options 


Whereas barriers and lookbacks focus on extreme values of the asset, Asian op- 
tions are determined by average case behaviour. 


e An average price Asian call option has payoff at the expiry date T given by 


1 T 
max (> f S(rjdr — E0). 
T Jo 


e An average price Asian put option has payoff at the expiry date T given by 


1 T. 
max (z — zf S(t)dt, 0) : 
T Jo 


Here we are replacing the final asset price S(T ) that would be used in a European 
option by the average asset price over the time period. 


e An average strike Asian call option has payoff at the expiry date T given by 


1 T 
max (ser — FI S(t)dt, 0) 3 
0 
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e An average strike Asian put option has payoff at the expiry date T given by 


1 T 
max Gf S(t)dt — S(T), 0) ; 
T Jo 


Here we are replacing the strike, or exercise, price Æ, that would be used in a 
European option, by the average asset price. 

Other Asian options can be defined, for instance, by replacing the continuous 
average i S(t)dt/T by an arithmetic average 


1 n 
Paa 


n l/n 
(Fise) 
i=l 


over n time points, 0 < ti < t2 <--- < tn < T. (In practice, as the real asset price 
does not change continuously, even the continuous average would have to be ap- 
proximated from discrete market data.) 

The path dependency for Asians is, in a sense, more complicated than that for 
barrier and lookback options. The payoff depends on the range of asset prices, not 
just the extremes. It is possible to accommodate Asian options into the Black- 
Scholes framework, but exact solutions have been found only in certain cases. One 
such case is treated in Exercise 19.6. 


or geometric average 


19.5 Bermudan and shout options 


A Bermudan option differs from the corresponding American option in only one 
respect. While the American option allows the holder to exercise at any time in 
[0, T], the Bermudan option restricts the early exercise facility to a fixed number 
of pre-determined dates. 

As in the American case, there is no general analytical formula for the Bermu- 
dan option value. 

The simplest version of a shout call option allows the holder to ‘shout’ at most 
once to the writer between times 0 and T. The payoff at expiry is given by 


| max(S(T) — E, S(t) — E), if holder shouted at time T, | 


max(S(T) — E, 0), if holder did not shout, (130) 


and we may make the perfectly sensible assumption that a shout will only take 
place if S(t) > E. The effect of shouting is to lock in a payoff of at least S(t) — E; 
the actual payoff will then be the maximum of this value and the European payoff. 
Typically, a shout will take place if the holder feels that the asset price has peaked 
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and is about to plummet. As with Americans and Bermudans, there is no exact 
valuation formula for shouts. 


19.6 Monte Carlo and binomial for exotics 


The Monte Carlo method that we described in Chapter 15 extends easily to handle 
path dependency. The extra step required is to set up a grid of points t; = j At, for 
0 < j < N, where N is a large number and At = T/N. We are given S(O) = So, 
so from (6.9) we can compute an asset price S(t;+1) in terms of S(t;) using the 
formula 


pel 
Sinore A NRN for iid. Z; ~ N(0,1). (19.7) 


(Note that we use the risk neutrality assumption, u = r.) This gives us the asset 
price at a closely spaced set of points in [0, T], so we can compute approxima- 
tions to the max, min or integral, and test for barrier crossings. For example, the 
following algorithm values an up-and-out call option. Here, M is the number of 
asset paths that we sample. 


fori = 1 to M 
for j = 0 to N — 1 
compute an N(0, 1) sample &; 
a Mes 
set Sj+1 = Sje 727 JAt+o VATE}; 
end 
set Spa = maxo<j<N Sj 
if S™* < B set V; =e"? max(Sy — E, 0), otherwise set V; = 0 
end 
set ay — tr’, V; 
set bt, = wo SNV =a 


The result gives an approximate option price ay and an approximate 95% confi- 
dence interval (15.5). 


For Asian options we could use the Riemann sum At yy 


gt S; to approximate 
the integral ie S(t)dt. With an average price Asian put this would give the fol- 


lowing algorithm: 


fori =1 to M 
for j =0 to N — 1 
compute an N(0, 1) sample &; 
set Sj+1 = Sje 37 Ato VATE 
end 
set Smean; = At 4 Sj 
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Table 19.1. Ninety-five per cent confidence intervals for Monte Carlo 
on a European up-and-out call. Black-Scholes value (19.5) is 0.0983 
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At = 107? At = 107? At = 1074 
M = 107 [0.0469, 0.1671] [0.0397, 0.1387] [0.0569, 0.1813] 
M = 10° [0.0961, 0.1347] [0.0756, 0.1104] [0.0726, 0.1046] 
M = 10+ [0.1042, 0.1163] [0.0997, 0.1112] [0.0926, 0.1038] 
M = 10° [0.1097, 0.1136] [0.1000, 0.1036] [0.0981, 0.1071] 


set V; = e "T (E — Smeanj, 0) 
end 
M 
set ay = i eras v: 
set bir = wn XLV — ay)’ 


Computational example We now apply Monte Carlo to the task of valuing 
an up-and-out call with Sọ = 5, E = 6, o = 0.3, r = 0.05 and T = 1, with 
barrier B = 8. The Black-Scholes value (19.5) was found to be 0.0983. Ta- 
ble 19.1 shows the 95% confidence intervals for timesteps Ar of 10-7, 107° 
and 1074 (so N = 10°, 10° and 104), and number of discrete sample paths M 
equal to 102, 10°, 104 and 10°. As the theory predicts, increasing M causes the 
confidence interval to shrink. However, in general the Monte Carlo method is 


over-estimating the option value. In particular, even for the largest sample size, 
M = 10°, the At = 107° and At = 107° confidence intervals do not contain 
the Black-Scholes value. To understand why, recall that the method is sampling 
the path at finitely many discrete points, rather than over the continuous inter- 
val [0, T]. Because the discrete test maxo<j<n Sj < B is less stringent than 
the continuous test maxo<;<7 S(t) < B, the Monte Carlo method allows more 
nonzero payoffs than it should. As At is refined (so N increases) the discrete 
test approaches the continuous one, and the bias becomes less pronounced. In 
Table 19.1, we see that for At = 1074 and M = 105, the confidence interval 
does contain the Black-Scholes value, although it is still skewed to the right. A 
more expensive simulation with At = 1075 and M = 10° improved the confi- 
dence interval to [0.0980, 0.0992]. © 


Although the Monte Carlo method typically produces low-accuracy solutions, 
it does have the benefit of flexibility. It should be clear that the pathwise sampling 
approach can be applied to any of the generalized path-dependent options men- 
tioned in Sections 19.2, 19.3, 19.4. 
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The binomial method does not naturally extend to path-dependent options, as 
the basic recombining tree of asset prices in Figure 16.1 loses track of individual 
asset paths. At time ty = T we have only a set of asset prices {SM pe 9 and no 
information about how those asset prices were reached. (In fact we are essentially 
averaging over all paths that finished at that price). Even so, researchers have de- 
veloped techniques for adapting the binomial method to barriers, lookbacks and 
Asians; see Section 19.7 for references. 

Conversely, as we have seen in Chapter 18, early exercise does not fit comfort- 
ably with Monte Carlo, but is easily incorporated into the binomial method. 

In the case of Bermudan options, it is clear that the binomial method may be 
used. In fact, as applied to American options in Section 18.4, the method is really 
approximating the American by a Bermudan with a large number of closely spaced 
early exercise points. Bermudan options can thus be handled this way if we simply 
make sure that the prescribed exercise dates are included in the set of times ¢;, and 
then use (18.7) if t; is an allowable exercise time and (16.3) otherwise. 

To handle the shout option with payoff (19.6), note that if a shout happened at 
time Tt, then the payoff may be written 


max (S(T) — S(t), 0) + S(t) — E. (19.8) 


From this point of view, a shout locks in a bonus of S(t) — E and moves the 
exercise price to S(t). Once t and S(t) are known, the first term in (19.8), 
max(S(T) — S(t), 0), corresponds to the payoff for a European option, so it is 
given by the Black-Scholes formula (8.19) with time set to t and exercise price 
set to S(t). We may thus use the approach outlined in Section 18.4 with (18.7) 
replaced by 


V; = max| value (19.8) from shouting at (¢;, SÌ), 
e" (pvit +a- pvt), (19.9) 


for0 <n <i andO <i < M — 1. The overall method is then defined by (16.1), 
(16.2) and (19.9). 


19.7 Notes and references 


The texts (Kwok, 1998) and (Wilmott et al., 1995), and any of the Wilmott 
incarnations, such as (Wilmott, 1998), give much more detail about how the 
Black-Scholes PDE framework can be used to value exotic options. Also (Hull, 
2000; Kwok, 1998; Wilmott, 1998) are good sources for analytical valuation for- 
mulas. 
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Chapter 13 of (Björk, 1998) deals with barriers and lookbacks from a 
martingale/risk-neutral perspective. 

The use of the binomial method for barriers, lookbacks and Asians is discussed 
in (Hull, 2000; Kwok, 1998). 

There are many ways in which the features discussed in this chapter have been 
extended and combined to produce ever more exotic varieties. In particular, early 
exercise can be built into almost any option. Examples can be found in (Hull, 
2000; Kwok, 1998; Taleb, 1997; Wilmott, 1998). 

Practical issues in the use of the Monte Carlo and binomial methods for exotic 
options are treated in (Clewlow and Strickland, 1998). 

From a trader’s perspective, “how to hedge’ is more important than ‘how to 
value’. The hedging issue is covered in (Taleb, 1997; Wilmott, 1998). 


EXERCISES 
19.1. xxx» Suppose that the function V (S, ft) satisfies the Black-Scholes PDE 


(8.15). Let 
po x [X 
VCS, t) = S} av ($) 
Show that 
ƏV i 909 02V a > a-z fav (X 
S S rV =s TR is | ag 
je a a \S 


jaf) OV FX Xav (X Rar 
+o (5) a (ž.:) roy ($) -rV (5-1) l 

Deduce that V(S , t) solves the Black-Scholes PDE. 

19.2. xx Using Exercise 19.1, deduce that C B in (19.3) satisfies the Black- 
Scholes PDE (8.15). Confirm also that C® satisfies the conditions (19.1) 
and (19.2) when B < E. 

19.3. x» Explain why (19.4) holds for all ‘down’ and ‘up’ barrier options. 

19.4. x Why does it not make sense to have B < E in an up-and-in call option? 

19.5. xx The value of an up-and-out call option should approach zero as S ap- 
proaches the barrier B from below. Verify that setting S = B in (19.5) re- 
turns the value zero. 

19.6. xxx» Consider the geometric average price Asian call option, with payoff 


A 1/n 
max (frs) —E,0]|, 
i=l 
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where the points {7;}”_, are equally spaced with 4; =i At and nAt =T. 
By writing 


n 2 3 
[[s@ _ Str) ($) (=) 
i=1 SGi-1) \SCtn-2)/ \SCtn-3) 


S(h) S(t) So 9 


and using the ‘additive mean and variance’ property of independent normal 


random variables mentioned as item (iii) at the end of Section 3.5, show 
that for the asset model (6.9) under risk neutrality, we have 


n 1/n 
log (I sw) js = Nfr = 142) a o2 (n+ ven +1) r); 


(Note in particular that this establishes a lognormality structure, akin to that 


of the underlying asset.) Valuing the option as the risk-neutral discounted 
expected payoff, deduce that the time-zero option value is equivalent to 
the discounted expected payoff for a European call option whose asset has 
volatility & satisfying 


a2 a(n +1)(2n + 1) 
o = o 
6n? 


and drift 7 given by 


~_ 142 | 2, (n + 1) 
= 50° + (r — 50°) ope 


Use Exercise 12.4 and the Black—Scholes formula (8.19) to deduce that the 
time-zero geometric average price Asian call option value can be written 


ee (Soe? VG) 5 EN@)) (19.10) 


where 


~ — log(So/E) + (A + 567)T 

dı = P ’ 
ENT 

d = dı —6VT. 


19.8 Program of Chapter 19 and walkthrough 199 


19.7. x» Write down a pseudo-code algorithm for Monte Carlo applied to a float- 
ing strike lookback put option. 


19.8 Program of Chapter 19 and walkthrough 


In ch19, listed in Figure 19.4, we value an up-and-out call option. The first part of the code is a 
straightforward evaluation of the Black-Scholes formula (19.5). The second part shows how a Monte 
Carlo approach can be used. This code follows closely the algorithm outlined in Section 19.6, except 
that the asset path computation is vectorized: rather than loop for j = 0 : N-1, we compute the 
full path in one fell swoop, using the cumprod function that we encountered in ch07. 

Running ch19 gives bsval = 0.1857 for the Black-Scholes value and conf = [0.1763, 
0.1937] for the Monte Carlo confidence interval. 


PROGRAMMING EXERCISES 


P19.1. Use a Monte Carlo approach to value a floating strike lookback put option. 


P19.2. Implement the binomial method for a shout option, using (19.9), and in- 
vestigate its rate of convergence. 


Quotes 


There are so many of them, and some of them are so esoteric, 
that the risks involved may not be properly understood 
even by the most sophisticated of investors. 
Some of these instruments appear to be specifically designed to enable institutions 
to take gambles which they would otherwise not be permitted to take ... 
One of the driving forces behind the development of derivatives 
was to escape regulations. 
GEORGE SOROS, source (Bass, 1999) 


The standard theory of contingent claim pricing through dynamic replication 
gives no special role to options. 
Using Monte Carlo simulation, path-dependent multivariate claims of great complexity 
can be priced as easily as the path-independent univariate hockey-stick payoffs which 
characterize options. 
It is thus not at all obvious why markets have organized to offer these simple payoffs, 
when other collections of functions 
such as polynomials, circular functions, or wavelets 
might offer greater advantages. 
PETER CARR, KEITH LEWIS AND DILIP MADAN, ‘On The Nature of Options’, 
Robert H. Smith School of Business, Smith Papers Online, 2001, source 
http://bmgt1 -notes.umd.edu/faculty/km/papers.nsf 


Do you believe that huge losses on derivatives are confined to reckless or dim-witted 
institutions? 
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%CH19 Program for Chapter 19 

% 

% Up-and-out call option 

% Evaluates Black-Scholes formula and also uses Monte Carlo 


randn(‘state’,100) 


APA” PPA” Problem and method parameters %%%%%%%%M%%% 
S = 5; E = 6; sigma = 0.25; r = 0.05; T = 1; B = 9; 

Dt = 1e-3; N = T/Dt; M = 1e4; 

Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo YoYo o o Po A AP P P P% 


APA% APAP” Black-Scholes value V%VLVVMVWVWWLVVVV% 
tau = T; 

power1 = -1 + (2*r)/(sigma‘2); 

power2 = 1 + (2*r)/(sigma‘2); 

d1 = (log(S/E) + (r + 0.5*sigma^2)*(tau))/(sigma*sqrt(tau)); 

d2 = d1 - sigma¥*sqrt(tau); 

e1 = (log(S/B) + (r + 0.5*sigma^2)*(tau))/(sigma¥*sqrt(tau)); 

e2 = (log(S/B) + (r - 0.5*sigma^2)*(tau))/(sigma*sqrt(tau)); 

f1 = (log(S/B) - (r - 0.5*sigma’2)*(tau))/(sigma*sqrt(tau)); 

f2 = (log(S/B) - (r + 0.5*sigma^2)*(tau))/(sigma¥*sqrt(tau)); 

g1 = (log(S*E/(B^2)) - (r - 0.5*sigma^2)*(tau))/(sigma*sqrt(tau)); 

g2 = (log(S*E/(B^2)) - (r + 0.5*sigma’2)*(tau))/(sigma*sqrt(tau)); 

Nd1 = 0.5*(1+erf(d1/sqrt(2))); Nd2 = 0.5*(1+erf(d2/sqrt(2))); 

Ne1 = 0.5*(1+erf(e1/sqrt(2))); Ne2 = 0.5*(1+erf(e2/sqrt(2))); 

Nf1 = 0.5*(1+erf(f1/sqrt(2))); Nf2 = 0.5*(1+erf(f2/sqrt(2))); 

Ng1 = 0.5*(1+erf(g1/sqrt(2))); Ng2 = 0.5*(1+erf(g2/sqrt(2))); 

a = (B/S) power1; b = (B/S) power2; 

bsval = S*(Nd1-Ne1-b*(Nf2-Ng2)) - E*exp(-r*tau)*(Nd2-Ne2-a*(Nf1-Ng1)) 
Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo 


V = zeros(M,1); 

fori = 1:M 
Svals = S*cumprod(exp((r-0.5*sigma’2)*Dt+sigma*sqrt(Dt)*randn(N,1))); 
Smax = max(Svals); 


if Smax < B 
V(i) = exp(-r*T)*max(Svals(end)-E,0); 
end 
end 


aM = mean(V); bM = std(V); 
conf = [aM - 1.96*bM/sqrt(M), aM + 1.96*bM/sqrt(M)] 


Fig. 19.4. Program of Chapter 19: ch19.m. 
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If so, consider: 
Procter & Gamble (lost $102 million in 1994) 
Gibson Greetings (lost $23 million in 1994) 
Orange County, California (bankrupted after $1.7 billion loss in 1994) 
Baring’s Bank (bankrupted after $1.3 billion loss in 1995) 
Sumitomo (lost $1.3 billion in 1996) 
Government of Belgium ($1.2 billion loss in 1997) 
National Westminster Bank (lost $143 million in 1997) 
PHILIP MCBRIDE JOHNSON (Johnson, 1999) 
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Historical volatility 


OUTLINE 


e Monte Carlo type estimates 
e maximum likelihood estimates 
e exponentially weighted moving averages 


20.1 Motivation 


We know that the volatility parameter, o, in the Black-Scholes formula cannot be 
observed directly. In Chapter 14 we saw how o for a particular asset can be esti- 
mated as the implied volatility, based on a reported option value. In this chapter 
we discuss another widely used approach — estimating the volatility from the pre- 
vious behaviour of the asset. This technique is independent of the option valuation 
problem. Here is the basic principle. 


Given that we have (a) a model for the behaviour of the asset price that involves o and (b) 
access to asset prices for all times up to the present, let us fit o in the model to the observed 
data. 


A value o* arising from this general procedure is called a historical volatility 
estimate. 


20.2 Monte Carlo type estimates 


We suppose that historical asset price data is available at equally spaced time val- 


ues f; := i At, so S(t;) is the asset price at time t;. We then define the log ratios 
S(t) 
U; :=1o : (20.1) 
BI eS) 


Our asset price model (6.9) assumes that the {U;} are independent, normal ran- 
dom variables with mean (u — 50°) At and variance o” At. From this point of 
view, getting hold of historical asset price data and forming the log ratios is 
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equivalent to sampling from an N((u — 50°) At, o? At) distribution. Hence, we 
could use a Monte Carlo approach to estimate the mean and variance. Sup- 
pose that t = t, is the current time and that the M + 1 most current asset prices 
{S(tr-m), S(th-m 41), ---, S(tn—1), S(tn)} are available. Using the corresponding 


log ratio data, {Uny} the sample mean (15.1) and variance estimate (15.2) 
become 
1 
au =a 3 Uiii (20.2) 
2 1 m 2 
bu =a 2 Unii -aņ?’. (20.3) 


We may therefore estimate the unknown parameter o by comparing the sample 
mean am With the exact mean (u — 50°)At from the model, or by comparing the 
sample variance Bi, with the exact variance o7 Ar from the model. In practice the 
latter works much better — see Exercise 20.1 — and hence we let 


Pere (20.4) 


Vat 


Exercise 20.2 shows that this can be written directly in terms of the U; values as 


M 


2 
1 1 1 
fe ES Gi pee SPs We. “G05 
= |e (ga hth ma (Leen) |. ens 


20.3 Accuracy of the sample variance estimate 


To get some idea of the accuracy of the estimate o* in (20.4) we take the view 
that we are essentially using Monte Carlo simulation to compute be as an approx- 
imation to the expected value of the random variable (U — E(U ))?, where U ~ 
N((u — 50°) At, o? At). (This is not exactly the case, as we are using an approxi- 
mation to E(U ).) Equivalently, after dividing through by Ar, we are using Monte 
Carlo simulation to compute ov = b+, / At as an approximation to the expected 
value of the random variable (T — E(U))’, where U ~ N((u — 50°) VAt, o’). 
Hence, from (15.5), an approximate 95% confidence interval for ø? is given by 


1.96v 

JM ` 
where v is the variance of the random variable (U — E(U yy’. Exercise 20.3 shows 
that 


o? 


v = 20%. (20.6) 
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So the approximate confidence interval for o? has the form 


1.96./202 
o* + 1.96/20? 


20.7 
Ti S 


It may then be argued that 


1.960* 
V2M 
is an approximate 95% confidence interval for ø, see Exercise 20.4. In particular, 


we recover the usual 1/ M behaviour. 
There is, however, a subtle point to be made. In a typical Monte Carlo simula- 


o*+ 


(20.8) 


tion, taking more samples (increasing M) means making more calls to a pseudo- 
random number generator. In the above context, though, taking more samples 
means looking up more data. There are two natural ways to do this. 


(1) Keep At fixed and simply go back further in time. 
(2) Fix the time interval, M At, over which the data is sampled and decrease At. 


Both approaches are far from perfect. Case (1) runs counter to the intuitive notion 
that recent data is more important than old data. (The asset price yesterday is more 
relevant than the asset price last year.) We will return to this issue later. Case (2) 
suffers from a practical limitation: the bid—ask spread introduces a noisy compo- 
nent into the asset price data that becomes significant when very small At values 
are measured. Overall, finding a compromise between large M and small At is a 
difficult task. 

If o* is computed in order to value an option, then a widely quoted rule of 
thumb is to make the historical data time-frame M At equal to that of the option: 
to value an option that expires in six months’ time, take six months of histori- 
cal data. There is also some evidence that taking longer historical data periods is 
worthwhile. 

Using the identity log(a/b) = loga — log b to simplify (20.2) we find that 


1 
aM = 2 (log S (tn+1—i) — log S (tn—i)) 


1 
S (log S (tn) — log S (tn-m)) 
l og SE 


ZM S SG 


Because those intermediate terms cancel, ay depends only on the first and last 
S values! Our asset price model assumes that log(S (tn)/S(tn-m)) is normal with 
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mean (u — 50°)M At and variance 07M At. Hence, 
At 
am ~N (u — łøo’)Ar, oo) (20.9) 


In practice, because ay is normal with small mean and variance, it is common to 
replace it by zero in (20.3), which leads to 


M 


1 1l 
ES r eh 20.10 
; At (M — 1) & "tii re 


instead of (20.5). This alternative has been found to be more reliable in general. 


20.4 Maximum likelihood estimate 


To justify further the historical volatility estimate (20.10), we will show that an 
almost identical quantity 


ors 222502. (20.11) 


can be derived from a maximum likelihood viewpoint. Note that (20.11) differs 
from (20.10) only in that M — 1 has become M. 
The maximum likelihood principle is based on the following idea: 


In the absence of any extra information, assume the event that we observed was the one 
that was most likely to happen. 

In terms of fitting an unknown parameter, the idea becomes: 

Choose the parameter value that makes the event that we observed have the maximum 
probability. 

As a simple example, consider the case where a coin is flipped four times. 
Suppose we think the coin is potentially biased — there is some p € [0, 1] such 
that, independently on each flip, the probability of heads (H) is p and the proba- 
bility of tails (T) is 1 — p. Suppose the four flips produce H,T,T,H. Then, under 
our assumption, the probability of this outcome is p x (1 — p) x (1— p) x p= 
p*(1 — p)*. Simple calculus shows that maximizing p?(1 — p)? over p € [0, 1] 
leads to p = F, which is, of course, intuitively reasonable for that data. Similarly, 
if we observed H,T,H,H, the resulting probability is pa — p). In this case, max- 
imizing over p € [0, 1] gives p = 3, also agreeing with our intuition. 

That simple example involved a sequence of independent observations, where 
each observation (the result of a coin flip) is a discrete random variable. In the 
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case where the model involves outcomes, say U1, U2,..., Uy, from a continuous 
random variable with density f(x) that involves some parameter, we look for the 
parameter value that maximizes the product 


fU) f U2)... f (Um). 


Formally, this maximizes the value of the corresponding probability density func- 
tion at the point (U1, U2, ..., Um). 

Returning to the case of estimating the value o from our observations of U; 
in (20.1), we first make a simplification. On the basis that U; /VAt ~N((u - 
507) VAt, a”), we take the view that the mean of Ui/ ~v At is negligible, and re- 
gard it as zero. The corresponding density function for each scaled observation 
Ui/ VAt becomes 


F(x) = eld, 
2mo? 


Our maximun likelihood estimate of ø is then found by maximizing 


M 
1 A ~ 

Il - e-Üimi/ C, where Onyi-i = UY, (20.12) 

= TO 


I= 


Exercise 20.5 shows that this leads to the estimate (20.11). 


20.5 Other volatility estimates 


Under the simplifying assumption that U; has zero mean, var(U;) = E(U o For 
the estimate (20.11) we have 


and hence we may interpret Ato*? asa sample mean approximation for this ex- 
pected value. Keep in mind that the samples, U A correspond to different points 
in time. It has been found that rather than treating each observation U; equally it 
is more appropriate to give extra weight to the most recent values. This leads to 
schemes of the general form 


M M 
Ato = X aUai where Soa =], (20.13) 
i=l i=l 


208 Historical volatility 


with a} > a2 >--->ay > 0. It is common to use geometrically declining 
weights: a+; = waj, for some 0 < w < 1. This produces the estimate 


The choice w = 0.94 is popular. Note that (0.94)!° ~ 0.54, (0.94)! ~ 0.0021 
and (0.94)°9° < 1075, so in this case, even if M is chosen to be very large, samples 
more than around a hundred Af units old are essentially ignored. 

If a new volatility estimate is needed at each time ty, there is a neat variation of 
this idea. Suppose Ato** is our estimate of Ato” computed at time t,, based on 
{U, ia: ,- Then an estimate for time ¢,41 can be computed as 


Atop,” = wAtos? + (1 — w)U gy). (20.14) 


This process is close to having geometrically declining weights, see Exercise 20.7, 
and has the advantage that updating from time ft, to time t,+1 does not require the 
old data U;, fori < n, to be accessed. 

Formulas such as (20.14) are sometimes referred to as exponentially weighted 
moving average (EWMA) models. Of course, the notion of computing a time- 
varying estimate of the volatility is inherently at odds with the underlying assump- 
tion of constant volatility that is used in the derivation of the Black-Scholes for- 
mula. Even so, it has been observed empirically that asset price volatility is not 
constant, and techniques that account for this fact have proved successful. 


20.6 Example with real data 


In Figure 20.1 we estimate historical volatility for the IBM daily and weekly data 
from Figures 5.1 and 5.2. In both cases, we assume that the data corresponds to 
equally spaced points in time. The daily data runs over 9 months (T = 3/4 years) 
and has 183 asset prices (M = 182), so we set At = T/M ~ 0.0041. The weekly 
data runs over 4 years (T = 4) and has 209 asset prices (M = 208), so we set 
At =T/M ~ 0.0192. 

For the daily data we found ay = —4.3 x 1074, confirming that it is reason- 
able to regard the log ratio mean as zero. The Monte Carlo based estimate (20.4) 
produced o* = 0.4069 with a 95% confidence interval of [0.3653, 0.4486]. Given 
that ay ~ O, it is not surprising that the simpler estimate (20.10) produced an 
almost identical value o* = 0.4070. This o* is represented as a dashed line in 
the upper picture. The EWMA is plotted as diamond shaped markers joined by 
straight lines. Here, we used the first 20 U; values to compute a Monte Carlo based 
estimate, and inserted this as a starting value for o* in the update formula (20.14). 
Our weight was w = 0.94. 
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Fig. 20.1. Historical volatility estimates for IBM data from Figures 5.1 and 
5.2. Upper picture: daily. Lower picture: weekly. Diamonds are exponentially 
weighted moving averages. Dashed lines show the estimate (20.10). 


The lower picture repeats the exercise for the weekly data. In this case (20.4) 
produced o* = 0.3610 with a 95% confidence interval of [0.3263, 0.3957]. We 
found that ay = —4.0 x 107° and the estimate (20.10) gave o* = 0.3621. 

Overall, the small size of the sample mean ay and the reasonable agreement 
between the daily and weekly o* estimates are encouraging. However, the large 
confidence intervals for these estimates, and the significant time dependency of 
the EWMA, are far from reassuring. Generally, extracting historical volatility es- 
timates from real data is a mixture of art and science. 


20.7 Notes and references 


Volatility estimation is undoubtedly one of the most important aspects of practical 
option valuation, and it remains an active research topic, see (Poon and Granger, 
2003), for example. 

More sophisticated time-varying volatility models, including autoregressive 
conditional heteroscedasticity (ARCH) and generalized autoregressive conditional 
heteroscedasticity (GARCH) are discussed in (Hull, 2000), for example. 

In addition to providing information for option valuation, historical volatility 
estimates are a key component in the determination of Value at Risk; see (Hull, 
2000, Chapter 4), for example. 


210 Historical volatility 


EXERCISES 


20.1. xx» Consider a Monte Carlo approach where the sample mean ay in 
(20.2) is used to approximate the exact mean (u — 50°)At in order to es- 
timate o. Suppose that a fixed time-frame M At is used for the log ratios. 
This corresponds to case (2) in Section 20.3. Show that the 95% confidence 
interval for the mean has width proportional to 1/M. Convince yourself that 
this is a poor method. [Hint: use (20.9) and refer to Chapter 15.] 

20.2. xx Establish (20.5). 

20.3. xx Let Z~N(O,1) and Y=a+8Z, for œ,ß eR. Show that 
var ((Y — E(Y))?) = 26%. Hence, verify (20.6). 

20.4. xx Use the expansion af Ve Le Je for small € > 0, to show how the 
approximate confidence interval (20.8) may be inferred from (20.7). 

20.5. xxx Show that maximizing (20.12) with respect to ø leads to the estimate 
(20.11). [Hints: (1) take logs — maximizing a positive quantity is equivalent 
to maximizing its log, (2) regard ø? as the unknown parameter, rather than 
o.] 

20.6. * Give a convincing argument for the constraint ee a; = 1 in (20.13). 

20.7. xx Explain why (20.14) almost corresponds to having geometrically declin- 
ing weights. 


20.8 Program of Chapter 20 and walkthrough 


In ch20, listed in Figure 20.2, we look at historical volatility estimation with artificial data, cre- 
ated with a random number generator. The array U has ith entry given by the log of the ratio 
asset (it+1)/asset (i), where the asset path, asset, is created in the usual way, using a volatility 
of sigma = 0.3. The Monte Carlo volatility estimate (20.4) turns out to be 0.2947, with an ap- 
proximate confidence interval (20.8) of cont = [0.2855, 0.3038]. The simplified estimate with 
sample mean set to zero also gives sigma2 = 0.2947. We then apply the EWMA formula (20.14) 
with w = 0.94, using a Monte Carlo estimate of the first twenty U values to initialize the volatility. 
The running estimate, s, is plotted and the exact level 0.3 is superimposed as a dashed white line. 
Figure 20.3 shows the picture. The final time EWMA volatility value was sigma3 = 0.2588. 

In this example, the Monte Carlo version performs better than EWMA. This is to be expected — 
we are generating paths that agree with our underlying model (6.9), so taking as many old data points 
as possible is clearly a good idea. The EWMA approach of giving extra weight to more recent data 
points is designed to improve the estimate when real stock market data is used. 


PROGRAMMING EXERCISES 


P20.1. Apply the techniques in ch20 to some real option data. 


P20.2. Compare implied and historical volatility estimates on some real option 
data. 


20.8 Program of Chapter 20 and walkthrough 


%CH20 Program for Chapter 20 
% 
% Computes historical volatility from artificially generated data 


clf 
randn(‘state’,100) 


APAPAP” APPA Parameters VPYVVMWMWWWVVVY% 
sigma = 0.3; r = 0.03; M = 2e3; Dt = 1/(M+1); 
Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo oP P P AP Yo 


asset = cumprod(exp((r-0.5*sigma’2)* Dt+sigma* sqrt(Dt)* randn(M+1,1))); 
U = log(asset(2:end)./asset(1:end-1)); 


%% Monte Carlo estimate based on all data %% 

Umean = mean(V); 

Ustd = std(U); 

sigma1 = Ustd/sqrt(Dt) 

cont = [sigma1*(1-1.96/sqrt(2*M)), sigma1*(1+1.96/sqrt(2*M))] 


%% Simplified estimate (assumes zero mean) 
sigma2 = sqrt(sum(U.°2)/((M-1)*Dt)) 


%% Running EWMA %% 

%% First get a starting value %% 
s = zeros(M,1); 

L=20; 

V = U(1:L); 

s(L) = std(V)/sqrt(Dt); 

%% Now do EWMA %% 


w = 0.94; 
for n = L:M-1 

s(n+1) = sqrt((w¥*Dt*s(n)"2 + (1-w)*U(n+1)2)/Dt); 
end 


sigma3 = s(end) 


plot([L:M], s(L:end),’r-d’) 

hold on 

plot([1 M],[sigma sigma],’w-',’LineWidth’,2) 
xlabel('t’), ylabel(’Volatility’), ylim([0, 0.5]), grid on 


Fig. 20.2. Program of Chapter 20: ch20.m. 
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Fig. 20.3. Figure produced by ch20. 


Quotes 


There are two main approaches to estimating volatility and correlation: 
a direct approach using historical data 
and an indirect approach of inferring volatility from option prices. 
The historical approach has the virtue of working directly 
with the most relevant data but is always handicapped by “looking backward’. 
Implied volatility is a naturally forward-looking measure, 
but it is difficult to separate estimation error from model error. 
For example, differing Black-Scholes implied volatilities 
could be due to non-constant volatility 
or could be due to violations of perfect market assumptions 
that have unequal impacts on different options 
(e.g., differences in liquidity and transactions costs among options). 
MARK BROADIE AND PAUL GLASSERMAN (Broadie and Glasserman, 1998) 


A headline in Enron’s 2000 annual report states 
‘In Volatile Markets, Everything Changes But Us.’ 
Sadly, Enron got it wrong. 
Testimony of FRANK PARTNOY, Professor of Law, University of San Diego School of 
Law, hearings before the United States Senate Committee on Governmental Affairs, 24 
January 2002. Taken from Financial Engineering News, June/July 2002, Issue No. 26. 


Since the statistical properties of the sample mean 
make it a very inaccurate estimate of the true mean, 
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taking deviations around zero rather than around the sample mean 
typically increases forecast accuracy. 
T. CLIFTON GREEN AND STEPHEN FIGLEWSKI (Green and Figlewski, 1999) 


The authors emphasize that, 
as even the most cursory examination of the historical record reveals, 
“geometric Brownian motion’ is at best a first approximation to the actual 
movements of the price of any real stock or collection of stocks. 
Even their assumption that the governing processes 
are stochastic — rather than examples of deterministic chaos — may in time 
be disproved by sufficiently sensitive measurement techniques. 
JAMES CASE, reviewing the book (Mantegna and Stanley, 2000) in Society for 
Industrial and Applied Mathematics (SIAM) News, Volume 34, January/February 2001. 


Long run is a misleading guide to current affairs. 
In the long run we are all dead. 
JOHN MAYNARD KEYNES (1883-1946), A Treatise on Monetary Reform, 
Chapter 3, 1923. 
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Monte Carlo Part II: variance reduction by 
antithetic variates 


OUTLINE 


e covariance 

e antithetic variates for uniform samples 
e antithetic variates for normal samples 
e barrier option example 


21.1 Motivation 


The Monte Carlo method gives a simple and flexible technique for option valu- 
ation. However, we have seen that it can be expensive. This chapter and the next 
cover two approaches that attempt to improve efficiency. The antithetic variates 
idea in this chapter has the benefit of being widely applicable and easy to imple- 
ment. In order to understand how the idea works, we need to discuss the concept 
of covariance between random variables. 


21.2 The big picture 


The Monte Carlo method uses the sample mean (15.1) to approximate the expected 
value of the random variable X, where the X; are i.i.d. with E(X;) = E(X). We 
saw in Chapter 15 that the width of the corresponding confidence interval is in- 
versely proportional to VM. This makes it an expensive business to improve the 
approximation by taking more samples. To get an extra digit of accuracy, that is, to 
shrink a confidence interval by a factor of 10, requires 100 times as many samples. 
However, we also saw that the confidence interval width scales with ./var(X;). 
This motivates the idea of replacing the X; in (15.1) with another sequence of i.i.d. 
random variables that have the same mean as the X; but with smaller variance. 
This is the idea behind variance reduction. One way to summarize the potential 
advantage is: 
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If we can reduce the variance in X; by a factor R < 1, then for a given number of samples, 
M, the new version has confidence intervals that are a factor ~R smaller. So for R = 107% 
the new version gives roughly k/2 extra digits of accuracy. 


Under the assumption that sampling from the new random variable sequence costs 
about the same as sampling from X;, we could re-state this from a slightly different 
viewpoint: 


If we can reduce the variance in X; by a factor R < 1, then the new method gives confi- 
dence intervals of the same width for R times less work. 


21.3 Dependence 


So far, we have focused on collections of independent random variables. In par- 
ticular, we have repeatedly used the result (3.9): if X and Y are independent then 
E(XY) = E(X)E(Y). To discuss variance reduction techniques for Monte Carlo, 


we now need to consider the case where random variables are not independent. 

Intuitively, two random variables are independent if knowing the value of one 
does not give any information about the value of the other. For illustration, sup- 
pose we flip two coins. Denote the four possible outcomes by {H1, H2}, {H1, T2}, 
{T,, H2}, {T1, T2}, where, for example, {H1, T2} signifies heads for the first coin 
and tails for the second. Now define random variables X and Y as follows. Let X 
take the value 1 if the first coin lands heads and 0 otherwise, and let Y take the 
value 1 if the first and second coins land heads and 0 otherwise. Thus 


1, for {H), H2}, 1, for {H, H2}, 

1, for {H, T2}, 0, for {Hj, T2}, 
X= d Y= 

0, for{T),Hs}, 7 0, for {T1, H2}, 

0, for {T], Tz}, 0, for {T], To}. 


It is intuitively clear that X and Y are not independent — for example, know- 
ing that X = 0 allows us to deduce immediately that Y = 0. To work out the 
expected values E(X), E(Y) and E(XY), we need the following information. 


Probability X Y XY 
{H1, H2} i 1 1 1 
{H1, T2} I 1 0 0 
{T,, H2} l 0 0 0 
{T1, T2} n 0 0 0 
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So applying the expected value formula (3.1), E(X) = Z, (Y) = 1 and E(XY) = 
L, Thus we have E(XY) 4 E(X)E(Y ), confirming that X and Y cannot be inde- 
pendent. 

As a measure of ‘dependence’ between the random variables X and Y, the co- 
variance, COV(X, Y ), is defined as follows, 


cov(x, Y) := E[(X — E(X) Y — E(Y))]. (21.1) 


Equivalently, we may write 


cov(x, Y) := E(XY) — E(X)E(Y), (21.2) 


see Exercise 21.1, and it follows immediately that if X and Y are independent then 
cov(X, Y) = 0. Loosely, from (21.1), if the covariance is positive then X and Y 
tend to be smaller than their means or larger than their means at the same time. 
Similarly, if the covariance is negative then X tends to be above its mean when Y 
is below its mean, and vice versa. In the example above we have cov(X, Y) = t 
which supports this interpretation. 


21.4 Antithetic variates: uniform example 


To illustrate the use of antithetic variates, suppose we apply Monte Carlo to ap- 
proximate the expected value 


1=Eev¥), where U ~ U(0, 1). (21.3) 


For reference, we note that Z = 2, see Exercise 21.2. A Monte Carlo estimate of J 
is 


1 M 
=F XOY;, where Y; = eV withii.d. U; ~ U(0,1). (21.4) 


We know that the accuracy of Monte Carlo is related to the variance of Y;. In this 
case we have 


2 
-7 
var(Y;) = Í a (21.5) 
see Exercise 21.2. 
Now consider using the antithetic variate Monte Carlo estimator 
PA 1 M pa vU V1-U; 
= phir where f; = a with U; ~ U(0, 1). 
j (21.6) 


This antithetic version ‘re-uses’ U; in the form | — U;. Note that 1 — U; ~ 
UO, 1), so E *(¥;) = = E z(e¥U), In terms of random number generation, Iņ and T; M 
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have the same costs — both use M uniform samples. (Note, however, that there is 
some overhead associated with Tu. Twice as many evaluations of the exponential 
and square root functions are required.) 

From the useful identity 


var(X + Y) = var(X) + var(Y) + 2cov(x, Y), (21.7) 
see Exercise 21.3, we have 
JU; J1I-Uj 
var (: e ) =! (var(ev™) + var(ev!-U) + 2cov(ev%, evr) 


= } (var(ev™) + cov(eV, ev-%)) (21.8) 


This is the key expression that tells us how the variance in Y; compares with the 
variance in the original Y;. 
Now 


(VTA f AT dx 
0 


and hence, using (21.2), 


1 
cov(eVUi, ev!-Ui) = | gE ena (21.9) 
0 
Inserting (21.5) and (21.9) in (21.8), we arrive at 
=~ 2 2 15 A. eS 
var(Y;) = ae + >| ev “ty I= dy, (21.10) 
4 4 2 Jo 


Using a numerical quadrature routine to approximate the integral in (21.10), we 
find that 


var(Y;) ~ 0.001 073. 


Hence, 


var(Y;) = 
=" m~ 181.2485. (21.11) 
var(Y;) 


It follows from the discussion in Section 21.2 that the antithetic version gives us at 
least an extra digit of accuracy for the same amount of random number generation. 


Computational example Table 21.1 shows the 95% confidence intervals for [jy 
and the antithetic version Iu for the problem (21.3). We did four tests, covering 
M = 107, 10°, 104, 10°, and used the same random number samples for the two 
methods. In addition to the confidence intervals, we give the ratio of the sizes 
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Table 21.1. Ninety-five per cent confidence intervals for (21.4) and 
(21.6) on problem (21.3), plus ratios of their widths 


M Standard Antithetic Ratio of widths 
102 [1.8841, 2.0752] [1.9875, 2.0012] 14.0 
10° [1.9538, 2.0087] [1.9976, 2.0017] 13.4 
104 [1.9890, 2.0062] [1.9997, 2.0010] 13.5 
105 [1.9969, 2.0023] [1.9998, 2.0002] 13.5 


of the two confidence intervals. This is precisely the ratio of the square roots 
of the sample variances. As predicted by (21.11), it converges to /181.2485 ~ 
13.5. As a practical note, it is worth emphasizing that the confidence intervals 
for the antithetic variates estimate were computed via the sample variance of 


F, which are independent, and not fev" U fev1-Ui a which are 


highly correlated. © 


21.5 Analysis of the uniform case 


To understand how the antithetic variate technique works, consider the more gen- 
eral case of approximating 


I =E(f(U)), where U ~ U(0, 1), 


for some function f. The standard Monte Carlo estimate is 
1 M 
Iu = 35 LFW), with i.i.d. U; ~ U(0, 1), (21.12) 


and the antithetic alternative is 


~ 1% fUD+fA-U) 
ane 5 s 


with i.i.d. U; ~ U(0, 1). (21.13) 


Copying the way that we derived (21.8), we find that 
Ui 1— Ui 
var(£ dt fC i) 


5 ) = į} (var (f (U) + cov (f (Ui), fA — Ui))). 
(21.14) 
The success of the new scheme hinges on whether var(5( f(UD+ 
fa- Ui))) is smaller than var(f(U;)). The identity (21.14) tells us that effi- 
ciency boils down to making cov (f (Ui), f(1 — U;)) as negative as possible. We 
want f(U;) to be big (relative to its mean) when f(1 — U;) is small (relative to 


its mean). Intuitively, this approach will work when f is monotonic. Loosely, the 
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antithetic variate technique attempts to compensate for samples that are above the 
mean by adding samples that are below the mean, and vice versa. 

We may convert this intuition into a mathematical result. First we recall that 
to say a function f is monotonic increasing means xı < x2 > f (x1) < f2). 
Similarly, to say a function f is monotonic decreasing means x, < x2 > f (x1) = 
f (x2). It follows straightforwardly that if f and g are both monotonic increasing 
functions or both monotonic decreasing functions then 


(fF) — fOe) —gQ)) 29,  forany x and y, (21.15) 
see Exercise 21.5. Now we prove a useful lemma. 
Lemma If f and g are both monotonic increasing functions or both monotonic 
decreasing functions then, for any random variable X, 
cov(f (X), g(X)) = 0. 


Proof Let Y be a random variable that is independent of X with the same 


distribution. From (21.15) we may write 


(F(X) -fY EX) — g(V)) = 0. 


So the random variable (f(X) — f(Y)) (g(X) — g(Y)) must have a non- 


negative expected value. Hence 


0 < EIX) — fO) (eX) - 8Y) 
= E[f X8 (X)] -EI X80) -EIS Y) WI +E Y) Y)]. 


Since X and Y are i.i.d., that last right-hand side simplifies to 


2E[f(X)8(X)] — 2E [f X) E [8(X)], 


which is 2 cov(f (X), g(X)), and the result follows. © 


Now note that if f is a monotonic increasing function, then so is — f (1 — x). 
Similarly, if f is a monotonic decreasing function, then so is — f (1 — x). In ei- 
ther case, applying our lemma gives cov(f (X), —f (1 — X)) > 0. Equivalently, 
cov(f (X), f — X)) < 0. In (21.14) this shows that 


var (FO? + f — Ui) 


5 ) < 5var(f(Ui)), (21.16) 
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when f is monotonic. In words: 


For monotonic f, the variance in the antithetic sample is always less than or equal to half 
that in the standard sample. 


Of course, this is only a bound. The actual improvement can be much better, as in 
the f(x) = ev* example of the previous section. 


21.6 Normal case 


The antithetic variates trick is not restricted to functions of uniform random vari- 
ables. In the case of 


I =E(f(U)), where U ~ N(0, 1), (21.17) 


the standard Monte Carlo estimate is 
1 
Iu => LFW), with iid. U; ~ N(, 1), (21.18) 


and the antithetic alternative is 


Ss, ee RO FU rare: 
Iu = 3 : 5 uw with iid. U; ~ N(O, 1). (21.19) 
Because the N(O, 1) distribution is symmetric about the origin, rather than about 


5, the antithetic estimate uses —U;, rather than 1 — U;. Of course, —U; is also an 
NO, 1) random variable. 


The above analysis that gave us (21.16) can then be repeated to give us 


os (eo + f(—Ui) 


5 ) < zvar (f (Ui)) (21.20) 


when f is monotonic. 


Computational example Here we show the antithetic variate trick in use with 
N(O, 1) samples. We take (21.17) with f(x) = (1/./e)e*, so that E(f (U)) = 1 
(see Exercise 15.3). (A similar computation was done in Chapter 15 for standard 


Monte Carlo. We now scale by 1/,/e so that the confidence intervals are easier 
to assimilate.) Table 21.2 shows the 95% confidence intervals for (21.18) and 
(21.19). As in the previous example, we took M = 107, 10°, 104, 10°, and used 
the same random number samples for the two methods. The antithetic version 
gives almost twice as much accuracy. © 
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Table 21.2. Ninety-five per cent confidence intervals for (21.18) and 
(21.19) on problem (21.17) with f (x) = (1/./e)e*, plus ratios of 
their widths 


M Standard Antithetic Ratio of widths 
102 [0.8247, 1.2819] [0.9518, 1.6767] 0.6 
10° (0.9713, 1.1574] [1.0166, 1.1244] 1.7 
104 [0.9647, 1.0137] [0.9945, 1.0243] 1.6 
105 [0.9953, 1.0115] [0.9955, 1.0046] 1.8 


21.7 Multivariate case 


The antithetic variates idea extends readily to the case where f is a function of 
more than one random variable. For example, suppose we wish to approximate 


I =E(f(U,V,W)), where U, V, W are i.i.d. ~ N(O, 1). 


The standard Monte Carlo estimate is 
M 
Im = 2S U Vi, Wi) with U;, V;, W; i.i.d. ~ N(O, 1) 
M M z lo l9 tIS fa lo Litenese® ’ ’ 


and the antithetic version is 
1 D f (Ui, Vi, Wi) + f (Ui, —Vi, —Wi) 
M 2 : 
with U;, V;, W; i.i.d. ~ N(O, 1). 


Iu = 
i=l 


An extension of the above analysis shows that benefits accrue when f is monotonic 
in each of the arguments. 


21.8 Antithetic variates in option valuation 


The application that we have in mind is, of course, Monte Carlo estimation of 
path-dependent exotic options. In this case we discretize the time interval [0, T] 
and compute risk-neutral asset prices at HL p with 4 =iAt, NAt =T. We 
know that on each increment the price update uses an N(0, 1) random variable 
Z; coming from the i.i.d. sequence {Zo, Z1, ..., Zy—1} according to (19.7). We 
wish to compute the expected value of some payoff function. We are therefore 
looking for the expected value of a function of the N i.i.d. N(O, 1) random vari- 
ables {Zo, Z1,..., Zn—1}. The antithetic variates technique is to take the average 
payoff from one path with samples {Zo, Z1,..., Zy—,} and another path with 
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Asset 


Time 


Fig. 21.1. A pair of discrete asset paths computed using antithetic variates. The 
payoff from both paths is averaged in order to give a single sample. 


samples {— Zo, —Z1,..., —Zn—1}. Where one path zig-zags, the other path zag- 
zigs. Figure 21.1 illustrates such a pair of paths. 


Computational example We value an up-and-in call option with So = 5, E = 
6,r = 0.05, o = 0.3 and T = 1, using a timestep At = 1074, so N = 10*. We 
take B = 8 for the barrier level. Recall from Section 19.2 that 


e the payoff is zero if the asset never attained the price B, that is, if maxjo,7] S(t) < 
B, 

e the payoff is equal to the European call value max(S(T) — E, 0) if the asset at- 
tained the price B, that is, if maxjo,7] S(t) > B. 


Using the ideas from Section 19.6, a basic Monte Carlo strategy can be sum- 
marized as follows: 


fori = 1 to M 
forj=0toN-—1 
compute an N(0, 1) sample &; 
set Sj+1 = Sjef 720 At +o VAt; 
end 
set S™* = maxo<j<n Sj 
if S™* > B set Vi = eT max(Sy — E, 0), otherwise V; = 0 


224 Monte Carlo Part IT: variance reduction by antithetic variates 


end 
set ay = (47) Vi 
set biy = (q7=t) Dini (Vi — am)? 
This gives an approximate option price ay and an approximate 95% confi- 
dence interval (15.5). 
The corresponding antithetic variate version is 


fori = 1 to M 
for j=0toN-1 
compute an N(0, 1) sample £; 
set Sj+1 = Sje 20°) Atto V Atë; 
set Sj41 = 5 je 20At-0 V Att; 
end 
set se = maxg<j<y Sj 
set S; = maxg<j<n Sj 
if S™* > B set V; = eT max(Sy — E, 0), otherwise V; = 0 
if 5S; > B set V; =e’! max(Sy — E, 0), otherwise V; = 0 
set V; = } (V; +V;) 
end 
set ay = + e Vi 
H A 
set bi, = Wo Via (Vi — am)? 


Table 21.3 shows the 95% confidence intervals, and the ratios of their widths, 
for M = 107, 10°, 104, 10°. We see that using antithetic variates shrinks the con- 
fidence intervals by a factor of around 1.5. As mentioned in Section 19.6, the 
overall accuracy of the process depends not only on the error in the Monte Carlo 
approximation to the mean, but also on the error arising from the time discretiza- 
tion — we take the maximum over a discrete set of points rather than over a con- 
tinuous time interval. In this experiment we found that using smaller At values 
did not significantly change the computed results, so the sampling error is dom- 
inant. Ko 


Table 21.3. Ninety-five per cent confidence intervals, plus ratios of their 
widths, for standard and antithetic Monte Carlo on an up-and-in call 


M Standard Antithetic Ratio of widths 
10° [0.0878, 0.3219] (0.1239, 0.3061] 1.3 
10° [0.2285, 0.3333] [0.2238, 0.2936] 1.5 
104 [0.2443, 0.2764] [0.2370, 0.2580] 1.5 


10° [0.2359, 0.2458] (0.2373, 0.2440] 1.5 
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21.9 Notes and references 


The texts (Hammersley and Handscombe, 1964; Madras, 2002; Ripley, 1987) that 
we mentioned in Chapter 15 are good sources of general information about an- 
tithetic variates, and (Boyle et al., 1997; Boyle, 1977; Clewlow and Strickland, 
1998; Jackel, 2002) look at practical issues for option valuation. 


EXERCISES 


21.1. * Show that (21.1) and (21.2) are equivalent and hence conclude that if X 
and Y are independent then cov(X, Y) = 0. 

21.2. x Show that J = 2 in (21.3) and confirm (21.5). 

21.3. x Establish the identity (21.7). [Hints: make use of (3.6) and (3.10) in 
(21.1).] 

21.4. «x Use your favourite scientific computation package to confirm that 
var(Y;) x~ 0.001073 in (21.10). (For example, a suitable approxima- 
tion to the integral Ío ev*+VI-* dx in (21.10) can be obtained from 
>> quadl(’exp(sqrt(x) + sqrt(1-x))’,0,1,1e-9) in MATLAB.) 

21.5. xx Prove the statement involving (21.15). 

21.6. x» Consider the case where f is a monotonic increasing function that is 
extremely expensive to evaluate on a computer — so much so that the cost 
of a sample from a pseudo-random number generator is negligible by com- 
parison. Can we still argue that the antithetic variate estimate (21.13) is at 
least as efficient as the standard one, (21.12)? 

21.7. x» Show that the antithetic estimators (21.13) and (21.19) are exact in the 
case where f is linear, that is, f (x) = ax + £, fora, p € R. What can you 
say about the corresponding confidence intervals? 

21.8. xxx Find a simple example where antithetic variates are less efficient than 
standard Monte Carlo. 


21.10 Program of Chapter 21 and walkthrough 


In ch21, listed in Figure 21.2, we value an up-and-out call option. We use the same parameters as for 
ch19, so we know that the Black-Scholes value is 0.1857. 

The first part of the for loop implements standard Monte Carlo, as in ch19. We then compute 
the payoffs with a negated version of the pseudo-random numbers in samples. The ith entry of the 
array Vanti thus contains the average of the payoffs for the ith asset path and its antithetic twin. 

Running ch21 gives conf = [0.1763, 0.1937] for the Monte Carlo confidence interval. This 
is identical to the interval produced by ch19, because by setting the random number generator to 
the same state with randn(’ state’ ,100), we are using exactly the same samples. The antithetic 
version gives confanti = [0.1807, 0.1921], which is roughly 1.5 times as small as the standard 
Monte Carlo confidence interval. 
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%CH21 Program for Chapter 21 

% 

% Up-and-out call option 

% Uses Monte Carlo with antithetic variates 


randn(‘state’,100) 


VVVV%% Problem and method parameters %%%%%%%%% 
S=5; E=6; sigma = 0.25; r=0.05; T= 1;B=9; 

Dt = 1e-3; N = T/Dt; M = 1e4; 

Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo o Poo P P P P AP A 


V = zeros(M,1); 
Vanti = zeros(M,1); 
fori = 1:M 
samples = randn(N,1); 


% standard Monte Carlo 
Svals = S*cumprod(exp((r-0.5* sigma’2)* Dt+sigma*sqrt(Dt)*samples)); 
Smax = max(Svals); 


if Smax < B 
V(i) = exp(-r*T)*max(Svals(end)-E,0); 
end 


% antithetic path 
Svals2 = S*cumprod(exp((r-0.5*sigma’2)* Dt-sigma*sqrt(Dt)*samples)); 
Smax2 = max(Svals2); 
V2=0 
if Smax2 < B 
V2 = exp(-r*T)* max(Svals2(end)-E,0); 
end 
Vanti(i) = 0.5*(V(i) + V2); 


end 
aM = mean(V); bM = std(V); 
conf = [aM - 1.96*bM/sqrt(M), aM + 1.96*bM/sqrt(M)] 


aManti = mean(Vanti); bManti = std(Vanti); 
confanti = [aManti - 1.96*bManti/sqrt(M), aManti + 1.96*bManti/sqrt(M)] 


Fig. 21.2. Program of Chapter 21: ch21.m. 
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PROGRAMMING EXERCISES 


P21.1. Alter ch21 to the case of a different exotic option. 


P21.2. Type help cov to learn about MATLAB’s covariance function, and apply 
it to the examples studied in this chapter. 


Quotes 


Monte Carlo simulation will continue to gain appeal 
as financial instruments become more complex, workstations become faster, 
and simulation software is adopted by more users. 
The use of variance reduction techniques 
along with the greater power of today’s workstations 
can help to reduce the execution time required for achieving acceptable precision 
to the point that simulation can be used by financial traders to value derivatives in real 
time. 
JOHN CHARNES, ‘Sharper estimates of derivative values’, Financial Engineering 
News, June/July 2002, Issue No. 26 


Even statisticians often fail to treat simulations seriously as experiments. 
BRIAN D. RIPLEY (Ripley, 1987) 


It’s not always easy to tell the difference between understanding 
and brute force computation. 
ROGER PENROSE, source www.apmaths.uwo.ca/ rcorless/ 
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Monte Carlo Part HI: variance reduction by 
control variates 


OUTLINE 


e control variates 
e Asian option example 


22.1 Motivation 


We saw in the previous chapter that the antithetic variates idea relies upon find- 
ing samples that are anticorrelated with the original random variable. In contrast, 
the technique discussed here relies upon finding samples that have some general 
known correlation. This control variate approach is less generic than antithetic vari- 
ates, as it requires some knowledge about the underlying random variables in the 
simulations. However, when it works it can be very powerful. 


22.2 Control variates 


Given that we wish to estimate E(X), suppose we can somehow find another ran- 
dom variable, Y, that is ‘close’ to X with known mean E(Y). Then the random 
variable 


Z=X+E(Y)-Y (22.1) 


satisfies E(Z) = E(X) + E(Y) — E(Y ) = E(X), and hence we could apply Monte 
Carlo to Z instead of X. In this context, Y is called the control variate. 

Since adding a constant to a random variable does not change its variance (Ex- 
ercise 3.6), we see immediately from (22.1) that 


var(Z) = var(X — Y). 


Hence, to get some benefit from this approach, we would like X — Y to have a 
smaller variance than X. This is what we mean above by ‘close’. Note, however, 
that it may be more expensive to sample Z than X. If var(Z) = Rivar(X) for 
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some Rı < 1 and the cost of sampling Z is R2 times that of sampling X, then we 
get an overall gain in efficiency if Ry R2 < 1, see Exercise 22.1. 
We may generalize (22.1) to the case of 


Zo =X+0(E(Y)-Y), (22.2) 


for any 0 € R. Note that we still have E(Zọ) = E(X), so we may apply Monte 
Carlo to Zg. In this case 


var(Z9) = var(X — 6Y) = var(X) — 20 cov(X, Y) + 6?var(Y). 


As 0 varies, the value of @ that minimizes this quadratic is given by 


cov(X, Y) 
Omin := Eaa (22.3) 
Further, we can show that var(Zọ) < var(X) if and only if 0 lies between 0 and 
20min, see Exercise 22.2. 

Of course, on a general problem we typically do not know cov(X, Y ) and hence 
cannot find Omin. However, it is possible to estimate cov(X, Y ), and hence Omin, 
during a Monte Carlo simulation. 

The name ‘control variate’ comes from the fact that the E(Y ) — Y term controls 
the Monte Carlo process. Suppose the covariance is positive, that is, COV(X, Y) := 
E((X — E(X)) (Y — E(Y ))) > O and 0 > 0. In this case, when X is larger than av- 
erage (X > E(X)) we would also expect Y to be larger than average (Y > E(Y)). 
Generally, adding the negative amount 0 (E(Y) — Y) helps to correct the over- 
estimate of E(X) from that sample of X. Similarly when X is smaller than aver- 
age (X < E(X)) we would also expect Y to be smaller than average (Y < E(Y)) 
and adding the positive amount 6 (E(Y) — Y ) helps to correct the underestimate. 
A similar argument applies when cov(X, Y) < Oandé@ < 0. 


Computational example We return to the example from the previous chapter of 


computing J = E(eV¥), where U ~ U(0, 1). For illustration, we take eU as our 


control variate, and use the fact that E(e’) = ie e* dx = e — 1. Since eVY and 


eU are close over [0, 1], we will try the simple 0 = 1 version. Thus the control 


variate Monte Carlo algorithm applies to Z = eVU +e—1—e. Table 22.1 
shows the 95% confidence intervals for the standard and control variate algo- 
rithms, and also the ratios of confidence interval widths. We did four tests, cov- 
ering M = 107, 10°, 10*, 10°, and used the same random number samples for 
the two methods. (Note that the confidence intervals for standard Monte Carlo 
are identical to those in Table 21.1, as we started the random number genera- 
tor at the same point.) We see that the control variate version has confidence 
intervals that are just over 4 times smaller. Separate computations confirm that 
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Table 22.1. Ninety-five per cent confidence intervals with standard and 


control variate algorithm (22.1) for 


E(eVU), plus ratios of their widths 


M Standard Control variate Ratio of widths 
102 [1.8841, 2.0752] [1.9601, 2.0031] 4.4 
10° [1.9538, 2.0087] [1.9951, 2.0084] 4.1 
104 [1.9890, 2.0062] [1.9994, 2.0036] 4.1 
105 [1.9969, 2.0023] [1.9993, 2.0006] 4.1 


Table 22.2. Ninety-five per cent confidence intervals with standard and 


control variate algorithm (22.2) for 


E(eYU), plus ratios of their widths 


M Standard 0-Control variate 0 Ratio of widths 
102 [1.8841, 2.0752] [1.9623, 2.0004] 0.89 5.0 
10° [1.9538, 2.0087] [1.9937, 2.0048] 0.88 4.9 
104 [1.9890, 2.0062] [1.9993, 2.0027] 0.88 5.0 
105 [1.9969, 2.0023] [1.9994, 2.0005] 0.88 5.0 


var(evU — eÏ ) is about 17 times smaller than var(ev¥). Next, we tried the more 
general version based on (22.2). Here, we initially used the U(O, 1) samples from 
the random number generator to estimate Cov(X, Y) and var(Y ), and hence esti- 
mate Omin in (22.3). The samples were then re-used for the Monte Carlo estimate 
of (22.2) with this 6 value. Table 22.2 gives the results, including the 0 values 
that arose. We see that the optimal @ estimates are close to 1, and the extra work 
has only slightly improved the confidence interval widths. © 


22.3 Control variates in option valuation 


The control variate idea can be used on path-dependent options where there is no 
known analytical expression for the option value, but there is an expression for a 
similar option. The classic example is an arithmetic average price Asian option, 
where the average is taken over a pre-set collection of discrete times {t;}_,. As 
described in Section 19.4, the payoff for the arithmetic average price Asian call 
option is 


1 n 
max Eso -£.0) (22.4) 


i=1 
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whereas the corresponding geometric average price Asian option has payoff 


A 1/n 
max (frs) —E,0|. (22.5) 
i=l 


We see that (22.5) differs from (22.4) only in that the arithmetic average has been 
replaced by a geometric average. If the discrete times are equally spaced, t; = i At, 
with At = T /n, then Exercise 19.6 shows that there is an exact formula for the 
geometric average option. However, for the arithmetic average version there is no 
known explicit formula. 

It is reasonable to expect the arithmetic and geometric versions to be well cor- 
related — typically, paths where one option has a large/small payoff should also be 
paths where the other option has a large/small payoff. Because we have the exact 
expression (19.10) for the value (that is, the expected payoff under risk neutrality) 
of the geometric version, we may use this option as a control variate when valuing 
the arithmetic version. 


Computational example We now use Monte Carlo to value the arithmetic av- 
erage price Asian option described above. We take So = 5, E = 6, r = 0.05, 
o = 0.3 and T = 1, and discrete time points At, 2At, ..., nAt, where n = 100, 
so At = 107°. Since we are not interested in the asset prices at any other times, 
we used At as the timestep in the algorithm and computed risk-neutral asset 
prices S(At), SQAt),..., SCN At). Table 22.3 shows the 95% confidence in- 
tervals for standard Monte Carlo and for the alternative that uses the geomet- 
ric average price Asian option as a control variate in the basic formulation 
(22.1). We used M = 107, 10°, 104, 10° samples. We see that the control vari- 
ate improves accuracy by a factor of around eight. In this case, sampling the 
control variate involves relatively little extra work, so the gain in efficiency is 
significant. © 


22.4 Notes and references 


The references (Hammersley and Handscombe, 1964; Madras, 2002; Ripley, 1987) 
deal with the use of control variates in general, and (Boyle et al., 1997; Boyle, 
1977; Clewlow and Strickland, 1998; Jackel, 2002) apply specifically to finance. 
The review paper (Boyle et al., 1997) also discusses a number of other variance 
reduction techniques. 

Because of the representation (3.8), any algorithm for approximating an ex- 
pected value may be thought of as a quadrature method, that is, a method 
for approximating integrals. Quadrature has a long and distinguished history 
in numerical analysis, and many methods have been developed. Monte Carlo 


22.4 Notes and references 


Table 22.3. Ninety-five per cent confidence intervals with 


standard and 0 = 1 control variate algorithm (22.1) for a 


barrier option, plus ratios of their widths 
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M Standard Control Variate Ratio of widths 
10° [0.0283, 0.1161] [0.0885, 0.1010] 7.1 
10° [0.0823, 0.1207] [0.0947, 0.0990] 8.9 
104 [0.0911, 0.1035] [0.0965, 0.0981] 8.2 
105 [0.0968, 0.1007] [0.0973, 0.0978] 8.2 


simulations for path-dependent options, where asset paths are computed at points 
At, 2At,...,NAt =T, correspond to N-dimensional integrals. In this con- 
text, although Monte Carlo is one of the few viable techniques, current re- 
search indicates that algorithms based on so-called low discrepancy sequences 
can be more efficient. Quasi Monte Carlo methods, which combine the effi- 
ciency of low discrepancy sequences with the confidence interval information from 
Monte Carlo, have also been developed recently. The texts (Hull, 2000; Jackel, 
2002; Kwok, 1998) and the survey (Boyle et al., 1997) give pointers to recent 
literature. 

Both variance reduction and hedging share the aim of making a random variable 
more predictable, and this connection can be exploited in practice, see (Clewlow 
and Strickland, 1998), for example. 


EXERCISES 


22.1. x» Confirm that if var(Z) = Rıvar(X) for some Rı < 1 and the cost of 
sampling Z is R2 times that of sampling X, then we get an overall gain in 
efficiency from applying Monte Carlo to (22.1) if Ri R2 < 1. 

22.2. x» Show that var(Zg) < var(X) if and only if 6 lies between 0 and 26min, 
where Omin is defined in (22.3). (Note that Onin may be negative.) 

22.3. x x x (This exercise relates to Section 15.4, but fits in with the general theme 
of variance reduction.) Suppose that a random variable V depends on some 
deterministic parameter, p, and we wish to compute 


E (V (p +h)) -E(WV(p)) 
h 


for some small increment h. Consider the following Monte Carlo ap- 
proaches: 
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Method 1 


(a) apply Monte Carlo to give an approximation A ~ E (V (p + h)), 

(b) apply Monte Carlo to give an approximation B ~ E (V (p)) (using a different 
pseudo-random number sequence from that in (a)), 

(c) compute (A — B)/h as the overall approximation. 


Method 2 


(a) apply Monte Carlo to give an approximation C ~ E(V (p +h) — V(p)), 
(b) compute C/h as the overall approximation. 


Using (21.7), explain why Method 2 is likely to be more successful than 
Method 1. 


22.5 Program of Chapter 22 and walkthrough 


In ch22, listed in Figure 22.1, we do a control variate computation of the type reported in Table 22.3. 
Our task is to value an arithmetic average price Asian option using the geometric average price 
Asian, which has a Black-Scholes formula, as control variate. After initializing the parameters, we 
evaluate the formula (19.10) for the geometric version. Next, we compute an M by L array Spath, 
whose ith row represents the ith asset path at times 0, At, 2At, ..., T. The standard Monte Carlo 
method is then applied. The command mean (Spath, 2) evaluates the sample mean over the second 
index; this returns an M by 1 array whose ith entry is the sample mean over the ith row of Spath. The 
quantity exp(-r*T) *max(arithave-E,0) then represents the array whose ith entry is the payoff 
of the arithmetic average price Asian option from the ith path. The Monte Carlo confidence interval 
for these payoff samples turned out to be confmc = [0.2479,0.2631]. 
For the geometric average price Asian control variate, we must evaluate the quantity 


N 1/N 
(11 sw) 


i=1 


To eliminate the possibility of under/overflow in the evaluation of the product mA 1 S(t; ) itis prudent 
to implement the equivalent form 


1 N 
exp (> X log sa») . 


i=l 
The variable geoave gives the pathwise geometric average and Pgeo then stores the payoffs — these 
are our control variate samples. The array Z thus contains samples corresponding to Z in (22.1). The 


resulting confidence interval is confcv = [0.2576, 0.2584]. This is nearly 20 times smaller than 
confmc. 


PROGRAMMING EXERCISES 
P22.1. Alter ch22 so that the 0 version (22.2) is used. 


P22.2. Test whether it is worthwhile to combine the antithetic and control variates 
techniques. 


22.5 Program of Chapter 22 and walkthrough 


%CH22 Program for Chapter 22 

% 

% Monte Carlo on an arithmetic average price Asian option 
% using a geometric average price Asian as control variate 


randn(‘state’,100) 


MWHVL%V% Problem and method parameters %%%%%%%%% 
S = 4; E = 4; sigma = 0.25; r = 0.03; T = 1; 

Dt = 1e-2; N = T/Dt; M = 1e4; 

Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo 


WWAVVVW%% Geom Asian exact Mean %%%%%%%%W%V%% 
sigsqT= sigma’2*T*(N+1)*(2*N+1)/(6*N*N); 
muT = 0.5*sigsqT + (r - 0.5* sigma’2)*T*(N+1)/(2*N); 


d1 = (log(S/E) + (muT + 0.5*sigsqT))/(sqrt(sigsqT)); 
d2 = d1 - sqrt(sigsqT); 


N1 = 0.5*(1+erf(d1/sqrt(2))); 
N2 = 0.5*(1+erf(d2/sqrt(2))); 


geo = exp(-r*T)*( S*exp(muT)*N1 - E*N2 ); 
Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo o PPP PP A 


Spath = S*cumprod(exp((r-0.5*sigma^2)*Dt+sigma*sqrt(Dt)*randn(M,N)),2); 


% Standard Monte Carlo 

arithave = mean(Spath,2); 

Parith = exp(-r*T)*max(arithave-E,0); % payoffs 

Pmean = mean(Parith); 

Pstd = std(Parith); 

confmc = [Pmean-1.96*Pstd/sqrt(M), Pmean+1.96*Pstd/sqrt(M)] 


% Control variate 

geoave = exp((1/N)*sum(log(Spath),2)); 

Pgeo = exp(-r*T)*max(geoave-E,0); % geo payoffs 

Z = Parith + geo - Pgeo; % control variate version 
Zmean = mean(Z); 

Zstd = std(Z); 

confcv = [Zmean-1.96*Zstd/sqrt(M), Zmean+1.96*Zstd/sqrt(M)] 


Fig. 22.1. Program of Chapter 22: ch22.m. 
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Quotes 


Simulation has a colourful language, and variance reduction techniques, 
especially clever ones, are often known as swindles. 
Presumably it is nature that is being swindled, 
but she frequently gets her own back. 
Variance reduction swindles quite frequently do not work, 
especially when more than one idea is tried simultaneously. 
BRIAN D. RIPLEY (Ripley, 1987) 


The antithetic method is easy to implement, 
but often leads to only modest error reductions. 
... The control variate technique can lead to very substantial error reductions, 
but its effectiveness hinges on finding a good control for each problem. 
PHELIM BOYLE, MARK BROADIE AND PAUL GLASSERMAN (Boyle et al., 1997) 


Spock: Random chance seems to have operated in our favor. 
McCoy: In plain non-Vulcan English, we’ve been lucky! 
Spock: I believe I said that, doctor. 
From STAR TREK, source http://us.imdb.com/Quotes?0060028 


Never let the continuous progress of CPU speeds 
and processing power 
be an excuse for ill-thought-out algorithm design. 
PETER JACKEL (Jackel, 2002) 
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Finite difference methods 


OUTLINE 


finite difference operators 
FTCS and BTCS 
local accuracy 


e von Neumann stability 
e convergence 
e Crank—Nicolson 


23.1 Motivation 


In Chapter 8 we obtained the Black-Scholes formula for a European call option 
by first deriving the PDE (8.15)-(8.18) and then displaying its analytical solution 
(8.19). Chapters 18 and 19 showed that the values of other options may also be 
characterized via the solutions to PDEs. In general, the PDEs that arise for valuing 
exotic options cannot be solved analytically. However, it is possible to compute 
approximate solutions. This chapter introduces finite difference methods, which 
represent the most popular computational approach. We have already come across 
the underlying idea, that of discretization, a number of times in this book. Here, we 
develop three widely used finite difference schemes for the basic heat equation and 
discuss their key properties. The next chapter focuses on the use of finite difference 
technology for option valuation. 


23.2 Finite difference operators 


Given a smooth function y : R —> R, we know from the definition of a derivative 
that, for small h, 

ya +h) ye) _ dy 

h dx 


(x). 
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Table 23.1. Difference operators 


Operator Symbol Definition Taylor series 

Forward difference A Ym+1 — Ym hy + th? y" +... 

Backward difference V Ym — Ym-1 hy’ — ik?y" +... 

Half central difference ô ppt E, hy'— 4 hey" +... 

Full central difference Ao 4(¥m+1 — Ym-1) hy’ + th? y” os, 

Second order central 8? Ym41 — 2Ym + ym-1 hy" = bhy” Pu 
difference 

Shift E Ym+1 y+ ols 

Average u n Æ Wad) y+ th’y 


If we let y,, denote the value y(mh) then this may be written 
d 
Ym+1 — Ym © h> (mh). (23.1) 
dx 


To ease the notation, we will use a prime, ’, to denote a derivative, so y’ means 
dy/dx and y” means d?y/dx?, and assume that functions are evaluated at x = mh 
unless otherwise stated. With the aid of a simple Taylor series expansion, we may 
extend (23.1) to 


Ym+1 — Yn = hy’ + shy" +e: 


The quantity Ym+1 — Ym is known as a forward difference and the associated for- 
ward difference operator, A, is defined by 


Aym = Ym+1 — Ym- 


(This is, of course, not to be confused with the delta of an option. In this chapter A 
will exclusively denote the forward difference operator.) In Table 23.1 we define a 
number of finite difference operators. These operators, which act on grid values 
Ym = y(mh), form the main building blocks of finite difference methods. The 
table also gives the first two terms in the corresponding Taylor series expansions; 
Exercise 23.2 asks you to verify them. Two of those definitions involve ‘half-way’ 
values, y,,4.1 = y((m + DA). 


23.3 Heat equation 
We will focus on a simple mathematical problem. Our task is to find a function of 
two variables, u(x, t), that satisfies the PDE 
ðu 3u 


Ay pa for O<x <L and 0<t<T, (23.2) 
x 
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subject to the initial condition 
u(x, 0) = g(x) (23.3) 
and the boundary conditions 
u(O, t) = a(t) and u(L,t) = b(t). (23.4) 


The PDE (23.2) is known as the heat equation, because u(x, t) describes the tem- 
perature at time ¢ of the point x on a thin metal bar with initial temperature profile 
(23.3) and with endpoints heated according to (23.4). There are two good reasons 
for focusing on this PDE. 


(i) It allows us to develop the ideas behind finite difference methods in a simple setting. 

(ii) The basic Black-Scholes PDE (8.15) can be translated into an equation of this form; 
Section 23.9 gives references. In Section 24.4 we will go part of the way towards that 
translation. 


We will follow the usual convention of regarding x as space and f¢ as time. (In 
the next chapter, however, x will correspond to asset price.) 
In the case L = x with 


g(x) = sin(x), a(t) = b(t) =0, (23.5) 
it is easy to verify that 
u(x,t) = e™ sin(x) (23.6) 


solves (23.2), (23.3) and (23.4). In Figure 23.1 we plot the solution (23.6). This 
will be used for reference when we derive finite difference methods. 


23.4 Discretization 


In computing an approximate solution to the PDE (23.2), (23.3) and (23.4), our first 
step is to discretize. We have already used the idea of discretization in a number of 
contexts: 


e development of an asset price model in Chapter 6, 
e derivation of the binomial method in Chapter 16, 
e extension of Monte Carlo to path-dependent option valuation in Chapter 19. 


The plan is to compute approximations to the PDE solution only at a finite set 
of points. We divide the space axis into Nx + 1 equally spaced points { pny, 
where h=L/N,, and the time axis into N;+1 equally spaced points 


{ik}, where k = T/N;. The points (jh, ik) form what is called the grid, or 
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Fig. 23.1. Heat equation solution u(x, t) for (23.2), (23.3) and (23.4) with initial 
and boundary conditions (23.5). 


the mesh. We seek values U i that approximate the solution on the grid, 
Ui ~ u(jh, ik), O<j<MN, and 0<i<N,. 


This notation is consistent with that in Chapter 16 in the sense that a superscript 
is used to denote the time level. Figure 23.2 illustrates the grid. The open circles 
indicate grid points where the solution is not yet known. Points where the initial 
condition (23.3) and boundary conditions (23.4) can be used to determine the so- 
lution are marked with filled circles. Hence, our task is to find numbers to put into 
the points marked o. We will do this by using finite difference operators to form 
equations that the grid values U i must satisfy. 


23.5 FTCS and BTCS 


The key step in deriving finite difference methods is to replace differential 
operators with finite difference operators. Our problem domain involves two 
independent variables, 0 < x < L and O < ¢t < T , and hence we must distinguish 
between difference operators in the x- and t-directions. We do this with a subscript, 
so, for example, 


A;U; = Uit! = Ui and AUi = yi 


i 
joi 7 Uj. 
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Fig. 23.2. Finite difference grid {jh, i 5 ue Points are spaced at a distance 


of h apart in the x-direction and k apart in the f-direction. 


A simple method for the heat equation (23.2) involves approximating the time 
derivative 3/ðt by the scaled forward difference in time, k~!A,, and the second 
order space derivative 07/4x? by the scaled second order central difference in 
space, aad . This gives the equation 


-1 i =2 277i 
k AU; —h ôU; =0, 
which may be expanded as 


i+1 i i i i 
U; =U Ua U a 
k h2 7 


A more revealing re-write is 


Ut = Ui + (1 — 2v)U} + vU}, (23.7) 


where v := k/h? is known as the mesh ratio. 

Suppose that all approximate solution values at time level i, {U ee are 
known. Now note that Uir =a((i + 1)k) and Ue = b((i + 1)k) are given by 
the boundary conditions (23.4). Equation (23.7) then gives a formula for comput- 


ing all other approximate values at time level 7 + 1, that is, {U 5 Lea Since we 
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Fig. 23.3. Stencil for FTCS. Solid circles indicate the location of values that 
must be known in order to obtain the value located at the open circle. 


are supplied with the time-zero values, U; 0 — = g(jh) from (23.3), this means that 


the complete set of approximations {Ui } Nr = o can be computed by stepping for- 


ward in time. The method defined by (23. 7) i is known as FTCS, which stands for 
forward difference in time, central difference in space. Figure 23.3 illustrates the 
stencil for FTCS. Here, the solid circles indicate the location of values U; 4) U; i 
and Ui , that must be known in order to obtain the value U; ‘+! located at the open 
e 

We may collect all the interior values at time level 7 into a vector, 


ra] 
PA 


Exercise 23.3 then asks you to confirm that FTCS may be written 


RM], (23.8) 


Ut! = FU +p’, for0 <i <N,-1, (23.9) 


with 


€ RTL, 


=æ 
© 
Il 


Fo = sm 
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where the matrix F has the form 


1-2v v 0 0 
v l-2 v 0 
P= 0 E ER om me ; | e raono, 
| : vate Mes, E eg 0 | 
*,1-2v v 
0 . 0 v l—=2v 
and the vector p’ has the form 
va(ik) 
0 
A | ere 


| 
EA 


Here, FU! denotes a matrix-vector product. 


Computational example Figure 23.4 illustrates a numerical solution produced 
by FTCS on the problem of Figure 23.1, with T = 3. We chose N, = 14 and 
N, = 199, so h = x/14 © 0.22 and k = 3/199 ~ 0.015, giving v % 0.3. The 
numerical solution appears to match the exact solution, shown in Figure 23.1. 
Computing the worst-case grid error, maXo< j<N,,0<i<N, |U i — u(jh, ik)|, pro- 
duced 0.0012, which confirms the close agreement. As can be seen from 
Figure 23.4, we used a grid where k is much smaller than h — we divided the x- 
axis into only 15 points, compared with 200 points on the t-axis. In Figure 23.5 
we show what happens if we try to correct this imbalance. Here, we reduced N; 
to 94, so k ~ 0.032 and v © 0.63. We see that the numerical solution has de- 
veloped oscillations that render it useless as an approximation to u(x, t). Taking 
smaller values of N;, that is, larger timesteps k, leads to more dramatic oscilla- 
tions. In Section 23.7 we develop some theory that explains this behaviour. We 
finish this section by deriving an alternative method that is more computationally 
expensive, but does not suffer from the type of instability seen in Figure 23.5. > 


Replacing the forward difference in time in FTCS by a backward difference 
gives 


k-V,Ui, — h-782U', = 0, 
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FTCS: v = 0.3 


t x 


Fig. 23.4. FTCS solution on the heat equation (23.2), (23.3) and (23.4) with 
initial and boundary conditions (23.5). Here N, = 14 and N; = 199, so v © 0.3. 


or, in more detail, 


a ee Dans 
Ui -Uj Uy WU | 


k h2 


It is convenient to write this as a process that goes from time level i to i + 1, that 
is, to increase the time index by 1, which allows the method to be written 


uit) =U} +v (uiti -20i tui). (23.10) 


The method defined by (23.10) is known as BTCS, which stands for backward 
difference in time, central difference in space. Figure 23.6 illustrates the stencil for 
BTCS. Unlike FTCS, with BTCS there is no explicit way to compute {U Ee 
from {U TAn Using the vector notation (23.8), Exercise 23.4 asks you to show 
that the recurrence (23.10) for BTCS may be written 


BU! =U! +q, for0<i<N,-1, (23.11) 
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FTCS: v = 0.63 


15 


x 


Fig. 23.5. FTCS solution on the heat equation (23.2), (23.3) and (23.4) with 
initial and boundary conditions (23.5). Here Ny = 14 and N; = 94, so v © 0.63. 


where the matrix B has the form 


1+2v —v 0 en 0 
—v 1+2v —v 0 
gal 2 k | e RADM), 
| ; = 0 | 
| —v 142v =v | 
| 0 0 —v 1+ 2v | 
(23.12) 
and the vector qt has the form 
va((i + 1)k) 
e RMTI, 


if] 
‘ain 


vb((i + Dk) 
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Fig. 23.6. Stencil for BTCS. Solid circles indicate the location of values that 
must be known in order to obtain the value located at the open circle. 


The formulation (23.11) reveals that, given U', we may compute U't! by solv- 
ing a system of linear equations. This is a standard problem in numerical analysis, 
see Section 23.9 for references. 


Computational example Figure 23.7 gives the BTCS numerical solution for 
the problem in Figure 23.1, with T = 3. We used N, = 14 and N, = 9, so 
h=7/14 0.22 and k = 3/9 ~ 0.33, giving v © 6.6. The numerical solution 
agrees qualitatively with the exact solution in Figure 23.1, and we found that 
the worst-case grid error, maxg< j<N,,0<i<N, |U i — u( jh, ik)|, was a respectable 
0.055. Ko 


23.6 Local accuracy 


It is intuitively reasonable to judge the accuracy of a finite difference method by 
looking at the residual when the exact solution is substituted into the difference for- 
mula. For FTCS, letting ui, denote the exact solution u(jh, ik), the local accuracy 
is defined to be 


Ri = kT Apu, = mae ie (23.13) 


Using the Taylor series results in Table 23.1, this may be expanded as 


ðu ð u 2 au 5 04u 4 
R= —= = ah — h 
i (+ uit +0) (+4 aye eK i) 


where all functions 9u/ðt, 0°u/dt?, etc., are evaluated at x = jh, t = ik. Since u 
satisfies the PDE (23.2), we have 


a7u TE 


ae ak atO + O18"). (23.14) 
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BTCS: v = 6.6 


Fig. 23.7. BTCS solution on the heat equation (23.2), (23.3) and (23.4) with 
initial and boundary conditions (23.5). Here N, = 14 and N, = 9, so v © 6.6. 


The expansion (23.14) shows that the local accuracy of FTCS behaves as O (k) + 
O (h?). Hence, FTCS may be described as first order in time and second order in 
space. 

For BTCS, the local accuracy is defined as 


Ri := k`! Vui, — h° szut. (23.15) 
In this case it is convenient to use Taylor series results from Table expansion about 
time level (i + 1)k, and we find that 


. a7u atu 
pte 1 1.72 
R; = — ak a2 iat 3x4 


+ O(k?) + O(n), (23.16) 


with the functions evaluated at x = jh, t =ik. Exercise 23.5 asks you to fill 
in the details. This shows that BTCS has the same order of local accuracy as 
FTCS. 


23.7 Von Neumann stability and convergence 


A fundamental, and seemingly modest, requirement of a finite difference method 
is that of convergence — the error should tend to zero as k and h are decreased to 
zero. It turns out that convergence is quite a subtle issue. One aspect that must be 
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addressed is the choice of norm in which convergence is measured; in the limit 
k — 0,h — 0, we are dealing with infinite-dimensional vector spaces, so we lose 
the property that “all norms are equivalent’. 

There is, however, a wonderful and very general result, known as the Lax Equiv- 
alence Theorem, which states that a method converges if and only if its local ac- 
curacy tends to zero as k —> 0, h — 0 and it satisfies a stability condition. The 
particular stability condition to be satisfied depends on the norm in which conver- 
gence is measured. We do not have the space to go into any detail on this matter, 
but readers with a feel for Fourier analysis may appreciate that the following sta- 
bility definition is related to the L2 norm. 


Definition A finite difference method generating approximations U i is stable in 
the sense of von Neumann if, ignoring initial and boundary conditions, under the 
substitution ui = é'elF/" it follows that! |£] < 1 for all Bh € [—z, z]. Here i 
denotes the unit imaginary number. © 


To illustrate the idea, taking FTCS in the form (23.7) and substituting U; = 
E‘elBIN gives 
gitleibjh — pgi Pih ibh 4 G wéi ebin 4 yet ePi eibh, 
So 
= velbh 4 (1 — 2v) + ve ihr 
1+v Ci —2+ a") 


=a (cle = grizan) 


w 
l 


yer (2i sinen) 
= 1 — 4v sin? (4h). 
The condition |&| < 1 thus becomes 
|1 — 4v sin?(5Bh)| < 1, 
which simplifies to 
0 < vsin?’ ($h) < 4. 


For Bh € [—7, x ] the quantity sin? (G Bh) takes values between 0 and 1, and hence 
stability in the sense of von Neumann for FTCS is equivalent to 


vs 


(23.17) 


NI- 


! A more general definition allows || < 1 + Ck for some constant C, but our simpler version suffices here. 
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Returning to our previous computations, we see that a stable value of v ~ 0.3 
was used for FTCS in Figure 23.4, whereas Figure 23.5 went beyond the stability 
limit, with v 0.63. In practice, FCTS is only useful for v < 5. If we consider 
refining the grid, that is reducing A and k to get more accuracy, then we do so while 
respecting this condition. It is typical to choose v, say v = 0.45, and consider the 
limit h — 0 with fixed mesh ratio k/h? = v. In this regime, k tends to zero much 
more quickly than A. 

Exercise 23.6 asks you to show that BTCS is unconditionally stable, that is, 
stability in the sense of von Neumann is guaranteed for all v > 0. This is consis- 
tent with Figure 23.7, where a relatively large value of v did not give rise to any 
instabilities. 


23.8 Crank-Nicolson 


We have seen that FTCS and BTCS are both of local accuracy O (k) + O(h?). 
The O (k) accuracy in time arises from the use of first order forward or backward 
differencing in time. The Crank—Nicolson method uses a clever trick to achieve 
second order in time without the need to deal with more than two time levels. 

To derive the Crank—Nicolson method, we temporarily entertain the idea of an 
intermediate time level at (i + J)k. The heat equation (23.2) may then be approx- 
imated by 


41 41 
EtU — neu"? =0. 


This finite difference formula has an appealing symmetry. However, we have intro- 
duced points that are not on the grid. We may overcome this difficulty by applying 
the time averaging operator, ur, on the right-hand term, to get a new method 


mel 4d 
713,07"? — h?m U? =0, 
that is 
kit —Ui)—h iuit + UF j= 
This may be written as 


2+ vit = vUT + vt) + U5, +20 — vU} + vU} y. (23.18) 


This is Crank—Nicolson. The stencil is shown in Figure 23.8. Because of its inher- 
ent symmetry, the method has local accuracy O (k?) + O(h?). Exercise 23.8 asks 
you to confirm this. Crank—Nicolson has two features in common with BTCS. 
First, it is implicit, requiring a system of linear equations to be solved in order to 
compute U'*! from U'Ż. The equations may be written 
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Fig. 23.8. Stencil for Crank—Nicolson. Solid circles indicate the location of val- 
ues that must be known in order to obtain the value located at the open circle. 


0 


BUT! = FU +r, for0<i <N,-1, (23.19) 
where the matrices B and F have the form 
es -4v 0 PAi Sa 0 J 
| —}v l+ov -4v 0 ar | 
S| 0 one oe et ey. : | eR Dx N-D, 
: "a Te —5v 1+v -4v 
0 nA oat 0 -4v l1+v 
lee iv DD ae ee, 0 4 
| xv l-—v xv 0 oh : 
Pe 0 ice a c RN Dx N-D 


| 
| 
$v l-v 2 | 


and the vector r! has the form 


k + 1)k)) 


| e RM! 5 


| 
l, Sv (b(ik) + E ET) 
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see Exercise 23.9. Second, it is stable in the sense of von Neumann for all v > 0, 
see Exercise 23.10. The extra order of local accuracy in time makes it a popular 
choice. Exercise 23.11 gives an alternative derivation of the method. 


Computational example Recall that the BTCS computation in Figure 23.7 pro- 
duced a worst-case grid error of 0.055. Switching to Crank—Nicolson, we find 
that the error reduces to 0.0019, which reflects the higher order of local accuracy 
in time. Ko 


23.9 Notes and references 


This chapter was designed to give only the most cursory introduction to finite 
differences. Excellent, accessible texts that give much more detail and, in par- 
ticular, describe methods for solving the linear systems such as (23.11) and 
(23.19), and also do justice to the Lax Equivalence Theorem, include (Iserles, 
1996; Mitchell and Griffiths, 1980; Morton and Mayers, 1994; Strikwerda, 
1989). A freely available work of similarly high quality is the unpublished 
text, Finite Difference and Spectral Methods for Ordinary and Partial Differ- 
ential Equations, 1996, by Lloyd N. Trefethen, which is downloadable from 
http://web.comlab.ox.ac.uk/oucl/work/nick.trefethen/pdetext.html. 

Details of how to transform the Black—Scholes PDE (8.15) into standard heat 
equation form (23.2) can be found, for example, in (Nielsen, 1999, Section 6.7) 
and (Wilmott et al., 1995, Section 5.4). 

Finite difference methods represent the most conceptually straightforward ap- 
proach to solving a PDE numerically, and they appear to be the most popular 
choice in the mathematical finance community. However, it is worth pointing out 
that there are other areas of science and engineering where numerical methods for 
PDEs have reached a greater level of maturity, and in many cases other techniques, 
most notably finite element methods, have found considerable favour. 


EXERCISES 


23.1. x» Show that AV = VA; that is, for any sequence {ym}, AV Ym = V Aym. 
Similarly, establish the following identities relating finite difference oper- 


ators: 
AV=V-A, 
AV = 8, 
Ao = uô, 


Ao = 6p, 
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23.2. 
23.3. 
23.4. 
23.5. 
23.6. 


23.7. 


23.8. 


23.9. 


23.10. 


23.11. 


Finite difference methods 


A? = 8E, 
A? = E8. 


xx Verify the Taylor series expansions in Table 23.1. 

x Verify that FTCS, (23.7), may be written in the form (23.9). 

x Verify that BTCS, (23.10), may be written in the form (23.11). 

xx Using Table 23.1, show that the local accuracy of BTCS, defined in 
(23.15), satisfies (23.16). 

x x x By copying the analysis that led to (23.17), show that BTCS is stable 
in the sense of von Neumann for all v > 0. 

xx Show that Crank—Nicolson, (23.18), can be expressed as 


(1 - vz) uit = (1 + fv?) U}. 


x x x By analogy with (23.13) and (23.15), define the local accuracy for 
Crank—Nicolson and show that it is O (k?) + O(h7). 

xx Verify that Crank—Nicolson, (23.18), may be written in the form 
(23.19). 

xxx Show that a von Neumann stability analysis of Crank—Nicolson, 
(23.18) leads to 


_ 1-2» sin? (5 Bh) 
1 + 2vsin?(4Bh) 


§ 


Deduce that the method is stable for all v > 0. 
xx Suppose we take the average of the FTCS equation (23.9) and the 
BTCS equation (23.11) to get 


z0 + BU! = 3U + F)U' + 5(p' +a’). 
Show that this method is Crank—Nicolson. (The second order accuracy 


in time may now be understood by observing that averaging the local 
accuracy expansions (23.14) and (23.16) causes the O (k) term to vanish.) 


23.10 Program of Chapter 23 and walkthrough 


The program ch23 implements BTCS for the heat equation (23.2) with initial and boundary con- 
ditions (23.5), and plots the solution in the style of Figure 23.7. It is listed in Figure 23.9. After 
initializing parameters, we set up the Nx-1 by Nx-1 array B, which has the form displayed in (23.12). 
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%CH23 Program for Chapter 23 
% 
% Backward time central space (BTCS) for heat eqn 


clf 

PVWVVVVVVVV%%% Parameters HVVVVWVVV%VV%V%%%% 

L = pi; Nx = 9; dx = L/Nx; 

T = 3; Nt = 19; dt = T/Nt; nu = dt/dx°2; 

Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Vo Yo Yo Yo Yo Yo Yo Yo Vo Yo Yo Yo 


B = (1+2*nu)*eye(Nx-1,Nx-1) - nu*diag(ones(Nx-2,1),1) - nu*diag(ones(Nx-2,1),-1); 
U = zeros(Nx-1,Nt+1); 
U(:,1) = sin([dx:dx:L-dx]’); 


fori = 1:Nt 
x = B\U(:,i); 
U(:,i+1) = x; 
end 


bc = zeros(1,Nt+1); 
U = [bc;U;bc]; 


mesh(U’) 
xlabel(’x’,/FontSize’,20’) 
ylabel(’t’,/FontSize’,20’) 


Fig. 23.9. Program of Chapter 23: ch23.m. 
This is done with eye, diag and ones. The command eye (Nx-1,Nx-1) sets up an identity matrix 


fa 0" tei eee he 6 
0 1 0... ... 


0 
c RWr-Dx(Ne-D 


i 
DEI 
OERE F) 
[i] 
! 


is created by ones (Nx-2,1) and used in the diag function. Generally, diag(v,k) creates a two- 


The array 


e RNx-2 


dimensional array with v placed down the kth sub-/super-diagonal and zeros elsewhere. In our case, 
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diag(ones(Nx-2, 1),1) and diag(ones(Nx-2, 1) ,-1) correspond to 


fo 1 0... ... 0 [SO SOE- vac Wee? dak r 

00 1 0...0 i En oe o| 

0 s | ihá 0 1 0 ae E : | € ROx-Dx(Ny-D)_ 
0 ; Ten i 

. mil 

[Oa Fags 0s 20) o] O a 0 1 0 


respectively. The Nx-1 by Nx-1 array U is used to store the numerical solution; successive columns 
hold the solution U' in (23.8) at successive time levels. The initial condition is inserted into the first 
column with U(:,1) = sin([dx:dx:L-dx]’);. We then enter a for loop that steps forward in 
time. Generally, if A and b are compatible two- and one-dimensional arrays, respectively, then A\b 
computes the solution x to the linear system A*x = b. It follows that the line x = B\U(:,i); solves 
the required system (23.11), and U(: ,i+1) = x; assigns this solution to the next column of U. Note 
that qi = 0 in (23.11) because of the zero boundary conditions. The line U = [bc;U; bc]; pads out 
U by adding a row of zeros at the top and bottom, corresponding to those zero boundary conditions. 


PROGRAMMING EXERCISES 


P23.1. Using colon subarray notation, as in ch16, or otherwise, alter ch23 so that 


FTCS is used. Toy with the stability constraint v < 7 


P23.2. Implement Crank—Nicolson on the heat equation and compare its accuracy 
with that of FTCS and BTCS. 


Quotes 


In order to solve this differential equation 
you look at it till a solution occurs to you. 
GEORGE POLY Á, 1887-1985, source http://math.furman.edu/~mwoodard/mquot.html 


Numerical theory for PDEs of evolution is sometimes presented in a deceptively simple 
way. 
On the face of it, nothing could be more straightforward: 
discretize all spatial derivatives by finite differences 
and apply a reputable ODE solver, 
without paying heed to the fact that, actually, 
one is attempting to solve a PDE. 
This nonsense has, unfortunately, taken root in many textbooks and lecture courses, 
which, not to mince words, propagate shoddy mathematics and poor numerical practice. 
Reputable literature is surprisingly scarce, 
considering the importance and depth of the subject. 
ARIEH ISERLES (Iserles, 1996) 
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Spelling note #1: the name is ‘Nicolson’, not ‘Nicholson’. 
LLOYD N. TREFETHEN, Finite Difference and Spectral Methods for Ordinary and 
Partial Differential Equations, 1996; see Section 23.9. 


24 
Finite difference methods for the Black—Scholes PDE 


OUTLINE 


e Black-Scholes PDE in reverse time 

e initial and boundary conditions 

e FTCS, BTCS and Crank—Nicolson 

e binomial method as a finite difference method 


24.1 Motivation 


The previous chapter introduced finite difference methods. Here, we apply this 
idea to the Black-Scholes PDE. This is not entirely straightforward because the 
PDE is slightly more general than the heat equation used in Chapter 23 and the 
boundary conditions are not quite so convenient. 


24.2 FTCS, BTCS and Crank-Nicolson for Black-Scholes 


The Black-Scholes PDE (8.15) is typically augmented with a final time condi- 
tion — examples that we have seen include (8.16), (8.25), (17.1) and (19.2). Since 
convention (and every book on numerical PDEs) dictates that problems should be 
specified in initial time condition form, we make the change of variable t = T — t. 
In this way t represents the time to expiry and runs from T to 0 when ¢ runs from 
0 to T. Under this transformation the Black-Scholes PDE (8.15) becomes 
2 

y a = rsk +rV =0. (24.1) 

In this section we focus on European calls and puts. The £ = T condition for a 
European call, (8.16), becomes the t = 0 condition 


1202 
=50° 8 


C(S, 0) = max(S(0) — E, 0). (24.2) 
Similarly, the European put condition (8.25) changes to 


P(S,0) = max(E — S(O), 0). (24.3) 
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Turning to boundary conditions, the European call and put involve the PDE on the 
domain S € [0, oo]. This presents a difficulty. We must represent this range by a 
finite set of points. A reasonable fix is to truncate the domain to S € [0, L], where 
L is some suitably large value. Using (8.17) and (8.18), this gives call boundary 
conditions 


C(0,t) =0 and C(L, tI) =L. (24.4) 
Similarly, from (8.26) and (8.27) we obtain 
P(O,t) = Ee! and P(L,t)=0 (24.5) 


for a European put. 
We are now able to use a grid { jh, Uh See. as shown in Figure 23.2. Letting 


vi 
V5 


denote the numerical solution at time level i, we have V° specified by the ini- 
tial data (24.2) or (24.3) and the boundary values vi and vi, forall <i <N; 
specified by the boundary conditions (24.4) or (24.5). 

To obtain a generalized version of FTCS for the PDE (24.1) we use the full 
central difference operator from Table 23.1 for the ƏV /ð S term and evaluate the 
V term at (jh, ik) to get the difference equation 


yitl _ yi vii, -wvitvi_, vi -vi 
Aie a L sai = “2 : rjh H Ja +rV; =0. 
(24.6) 
The corresponding generalization of BTCS is 
i f i+1 i+] i+1 
PVE Sega Ya) 
p nN re 
il yi fl 
; j+1 j—1 j 
= rjh (5 =] + ner = 0. (24.7) 


The matrix—vector representation of FTCS in (23.9) remains valid if we re- 
define 


F = (1 —rk)I + xko7DoT2 + 5kr DiT 
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and 
sk(o* —r)V4 
0 
p= | | 
| S 
E T nvi, | 
where 


and 
0 1 0 1 0 we O 
-1 0 1 oy a ae 
T,= 0 1 a ee E | 
w sE 
—1 0 1 ty “oe i 
0 0 -=1 0 0 0 1 -2 


Similarly, BTCS has the form (23.11) with 
B = (14+rk)I — tko’ DoT) — Łkr Di Ty 


Tk(o? —r)vį"! | 
0 

| | 

| | 

| 


| 
| : | 
EOE I 


and 


> 


see Exercise 24.1. 
One way to generalize the Crank—Nicolson scheme (23.18) is to adopt the view- 
point of Exercise 23.11 and take the average of the FTCS and BTCS formulas 
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(23.9) and (23.11) to give 
z + BV! = 50+ F)V +3 +a’). (24.8) 


Computational example We used our three finite difference methods to value 


a European put option with parameters E = 4, ø = 0.3, r = 0.03 and T = 1. 
We truncated the asset range at L = 10. Since the exact value is known from 
the Black-Scholes formula (8.24), we may check the error. We focused on the 
maximum error at time zero: 
er? := max |V" —-V(jh,t =T). (24.9) 
lsj<Nx-1 / 

With N, = 50 and N; = 500, so k =2 x 107? and h = 0.2, we found that 
err? = 1.5 x 107° for FTCS and err® = 1.7 x 107° for BTCS. With Crank- 
Nicolson we were able to reduce N; to 50, so k = 2 x 107°, and still get a 
comparable error, err? = 1.6 x 1073. Ko 


Our treatment of stability and convergence of finite difference methods in 
Chapter 23 does not carry through directly to this section, since the PDE (24.1) 
has nonconstant coefficients and includes a first order spatial derivative. However, 
similar conclusions may be drawn; see Section 24.5. 


24.3 Down-and-out call example 


To illustrate the flexibility of finite difference methods, we turn to the down-and- 
out call defined in Section 19.2. We know that the PDE holds for B < S. Hene, awe 
may truncate this to B < S$ < L and use a grid of the form {B + jh, ik) 3 a a 
where h = (L — B)/N,. The FTCS scheme (24.6) becomes 


vray (vi - 2v + vi i) 
= — 50°(B + jh)? + 
Vi- Vi l 
—r(B + jh) (He i ') +rvi =0 
and the corresponding BTCS version is 
yitt_ yi (viti -2vit! + viti) 
j 7 e ES 5 


=f (B + jh) (a) +rVit = 0. 
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As before, these may be written in the matrix—vector forms (23.9) and (23.11), and 
the Crank—Nicolson method is given by (24.8). 

The t=O condition (19.2) specifies V? = = max(B + jh — E, 0) and the 
left-hand boundary condition (19.1) gives Vo! = 0. At the right-hand boundary, 
a reasonable approach is to argue that, since S is large, the asset is very unlikely to 
hit the out barrier, so VN, = C(L, t) may be imposed, where C (S, t) denotes the 
European call value. 


Computational example For the case B = 2, E = 4, ø = 0.3, r = 0.03 and 
T = 1 we used Crank—Nicolson to value a down-and-out call. In this case the 


exact solution (19.3) may be used to check the error. With the asset domain 
truncated at L = 10, and with Ny = N; = 50, we found the maximum time-zero 
error (24.9) to be err? = 1.1 x 1073. © 


24.4 Binomial method as finite differences 


Looking back to Chapter 16, we see some similarities between the binomial and 
finite difference methods: 


e both work with discretizations of the time and asset domains, 
e both advance in the time direction, 
e both are designed to be more accurate as the discretization is refined. 


The binomial method works in backward time — starting with option values at 
t = T and finishing with a value at t = 0 and S = So. The finite difference meth- 
ods are more general, in that they produce option values at all grid-points { jh, ik}; 
in particular, at time zero, option values are available for all initial asset prices 
0,h,2h,..., L. Nevertheless, it should seem plausible that the binomial method 
may be regarded as some explicit finite difference scheme that has been customized 
to produce a single time-zero option value. In this section we explain how the con- 
nection can be made concrete. 

Starting with (8.15), we make the transformation X = log S, which produces 
the constant coefficient PDE 


We then let V = e’’W. This has the effect of eliminating the zeroth derivative 
term, to give 


aw, 28°W 
+ 30 3X2 


aw 
re jo) — 0, (24.10) 
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see Exercise 24.4. 
Now, applying a backward difference formula for the time derivative and central 
differences for the space derivatives in (24.10) leads to the finite difference formula 


i+1 j i+] i+1 i+] 
w -Wi 1_2 Wigi- 22W; PWG 
e n2 
wit! — ae 
r= o> ( ET) ao a 


Setting A? = o7k has the effect of eliminating the wit! terms in (24.11), and the 
formula then reduces to 


wi = pwit t+ - p*Witi, (24.12) 


where p* = 5(1 + JSk(r/o — o/2)). Transforming back to V we find that 
Vise (pvit + a — pvitt). (24.13) 


Comparing (24.13) and (16.3), we see that the binomial method corresponds 
to using an explicit finite difference method on a transformed version of the 
Black-Scholes PDE. The finite differences are applied on a sub-grid, as illus- 
trated in Figure 24.1. The coupling h? = øo?k puts the method on the very cusp 
of von Neumann instability, see Exercise 24.5, which explains the undesirable but 
noncatastrophic oscillations observed in Section 16.4. 


24.5 Notes and references 


As we mentioned in Chapter 23, it is possible to convert the Black-Scholes PDE 
for European calls and puts into the heat equation form (23.2). Hence, it is perfectly 
reasonable to convert to that form before applying a finite difference method. We 
showed how to work directly with the Black-Scholes version (in reverse time) 
because in the case of more complicated options such a transformation may not 
be possible. We chose to discretize the spatial first derivative dV /0S in (24.1) by 
a central difference. An alternative that is better in the case where the volatility 
is very small is upwind differencing; see (Iserles, 1996; Mitchell and Griffiths, 
1980; Morton and Mayers, 1994; Strikwerda, 1989). 

The texts (Clewlow and Strickland, 1998; Kwok, 1998; Wilmott, 1998; Wilmott 
et al., 1995; Seydel, 2002) are good sources for more details about the application 
of finite differences to option valuation. 

We saw in Chapter 18 that the problem of valuing an American option can be 
couched in terms of a linear complementarity problem. It is possible to develop 
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Fig. 24.1. An example of a finite difference grid { jh, ik} (0 ;_0: Crosses mark 
points used by the binomial method (24.13) to obtain a single time-zero option 
value. 


finite difference schemes for such problems; see (Wilmott et al., 1995), for exam- 
ple. A promising, but often overlooked, alternative is to use a penalty method. In- 
deed, the basic binomial method of Chapter 18 is an example of a simple, explicit 
penalty method. More accurate versions are developed and analysed in (Forsyth 
and Vetzal, 2002). Our illustration in Section 24.4 of the connection between bi- 
nomial and finite difference methods was based on Appendix C of (Forsyth and 
Vetzal, 2002). A fuller treatment of this topic can be found in (Kwok, 1998). 

It is worth making the point that the development and implementation of nu- 
merical methods for PDEs is an area where a beginner is generally best advised to 
make use of existing technology: ‘off the shelf’ is preferable to ‘roll your own’. 
However, a basic understanding of the nature of simple numerical methods, at the 
level of these last two chapters, gives a good feel for what to expect from PDE 
solvers. 

MATLAB comes with a fairly simple built-in PDE solver, pdepe, and may 
be augmented with a PDE toolbox. Generally, there is an abundance of nu- 
merical PDE software available, both commercially and in the public domain. 
Good places to start are the Netlib Repository www.netlib.org/liblist.html and 
the Differential Equations and Related Topics page http://www.maths.dundee. 
ac.uk/software/index.html#DEs maintained by David Griffiths at the University 
of Dundee. 
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%CH24 Program for Chapter 24 
% 
% Crank-Nicolson for a European put 


clf 


VVVV%V%% Problem and method parameters %%%%%%% 
E = 4; sigma = 0.3; r = 0.03; T = 1; 

L = 10; Nx = 50; Nt = 50; k = T/Nt; h = L/Nx; 

Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo Yo oo PP P P P PA 


T1 = diag(ones(Nx-2,1),1) - diag(ones(Nx-2,1),-1); 

T2 = -2*eye(Nx-1,Nx-1) + diag(ones(Nx-2,1),1) + diag(ones(Nx-2,1),-1); 
mvec = [1:Nx-1]; 

D1 = diag(mvec); 

D2 = diag(mvec.°2); 

F = (1-r*k)*eye(Nx-1,Nx-1) + 0.5*k*sigma’2*D2*T2 + 0.5*k*r*D1*T1; 

B = (14r*k)*eye(Nx-1,Nx-1) - 0.5*k*sigma’2*D2*T2 - 0.5*k*r*D1*T1; 
A1 = 0.5*(eye(Nx-1,Nx-1) + F); 

A2 = 0.5*(eye(Nx-1,Nx-1) + B); 


U = zeros(Nx-1,Nt+1); 
U(:,1) = max(E-[h:h:L-h]’,0); 


for i = 1:Nt 
tau = (i-1)*k; 
p1 = k*(0.5*sigma’2 - 0.5*r)*E*exp(-r*(tau)); 
q1 = k*(0.5*sigma’2 - 0.5*r)*E*exp(-r*(tau+k)); 
rhs = A1*U(:,i) + [0.5*(p1+q1); zeros(Nx-2,1)]; 


X = A2\rhs; 
U(:,i+1) = X; 
end 


bca = E*exp(-r*[0:k:T]); 

bcb = zeros(1,Nt+1); 

U = [bca;U;bcb]; 

mesh([0:k:T],[0:h:L],U) 

xlabel(’T-t’), ylabel(’S’), zlabel(’Put Value’) 


Fig. 24.2. Program of Chapter 24: ch24.m. 


24.1. 


24.2. 


24.3. 


24.4. 


24.5. 
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EXERCISES 


x Confirm that FTCS in (24.6) and BTCS in (24.7) have matrix-vector 
forms (23.9) and (23.11), respectively, as indicated in Section 24.2. 

xx In the case of a European call option, point out a contradiction in the 
initial and boundary conditions (24.2) and (24.4). How could this be over- 
come? 

xx Write the FTCS, BTCS and Crank—Nicolson methods for a down-and- 
out call option in matrix—vector form. 

x x x Confirm that the transformations given in Section 24.4 convert (8.15) 
to (24.10). 

xx Suppose that a constant diffusion coefficient, 50°, is introduced into the 
heat equation (23.2) to give 


The FTCS method would then use 
—1 i _ lp—2ş2yyri 


Show that the von Neumann stability condition takes the form o?°k < h?. 


24.6 Program of Chapter 24 and walkthrough 


Our program ch24 implements Crank—Nicolson, (24.8), for a European put, producing a picture like 


that in Figure 11.4. It is listed in Figure 24.2. The structure of the code is similar to ch23, and the 


commands used have been explained in previous chapters. 


PROGRAMMING EXERCISES 


P24.1. Alter ch24 so that it values a down-and-out call option. 

P24.2. Investigate the use of MATLAB’s built-in PDE solver pdepe for option 
valuation. Type help pdepe or consult (Higham and Higham, 2000, Section 12.4) 
for details of how to use pdepe. 


Quote 


...one reason I’ve found financial engineering so exciting 

is that banks pay attention to a lot of academic work. 

In that sense, it’s a very aggressive area, 

because if you have a new method for solving a problem of interest, 
there will be listeners. 

And they’ll come back, ask questions, be on the phone, 

and fill the seminar room. 


TOM COLEMAN, Financial Engineering News, September/October 2002 
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